Yu Su

18 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×7NLP×5AI×5ML×4Info Retrieval×2Software Eng.×2Architecture×1Distributed×1

Frequent co-authors

Yu Cui2×

Ruiqing Yue2×

Sicheng Pan2×

Zhuoyu Sun2×

Baohan Huang2×

Haibin Zhang2×

Research Timeline

2026

Spore: Efficient and Training-Free Privacy Extraction Attack on LLMs via Inference-Time Hybrid Probing

The paper introduces extsc{Spore}, a novel, training-free, and highly efficient privacy extraction attack that targets sensitive information stored in the memory of LLM agents during inference, outperforming existing state-of-the-art methods.

From Compression to Accountability: Harmless Copyright Protection for Dataset Distillation

The paper proposes SubPopMark, a novel subpopulation-driven framework that injects harmless, verifiable markers into distilled datasets to prevent copyright infringement and data leakage.

HIDBench: Benchmarking Large Language Models for Host-Based Intrusion Detection

The paper introduces HIDBench, a new benchmark for evaluating LLMs' ability to perform host-based intrusion detection using complex, noisy system logs, finding that model performance degrades significantly with increased data complexity.

VIPER-MCP: Detecting and Exploiting Taint-Style Vulnerabilities in Model Context Protocol Servers

VIPER-MCP is a novel, end-to-end automated framework that detects and dynamically confirms the exploitability of taint-style vulnerabilities in Model Context Protocol (MCP) servers, achieving high-fidelity vulnerability discovery in real-world systems.

In-Context Reward Adaptation for Robust Preference Modeling

The paper proposes In-Context Reward Adaptation, a transformer-based framework that uses in-context learning and auxiliary signals (like human response time) to robustly model diverse and unseen human preferences for better AI alignment.

AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

The paper introduces AGENTCL, a rigorous evaluation framework that uses controlled task streams to accurately measure an agent's ability to accumulate and reuse knowledge across multiple tasks, thereby addressing limitations in current continual learning benchmarks.

EVA-Net: Subject-Independent EEG Motor Decoding with Video-Derived Motor Priors

EVA-Net proposes a two-stage framework that uses action videos as semantic priors to achieve strong subject-independent EEG motor decoding, significantly outperforming text-based methods.

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

The paper introduces SkillHarm, a comprehensive benchmark and automated framework for evaluating skill-based attacks across the entire agent skill-use lifecycle, demonstrating that current agents remain highly vulnerable to both fixed-payload and self-mutating poisoning attacks.

Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time

The paper proposes Resonant Context Anchoring (RCA), a lightweight, training-free method that enhances factual faithfulness in LLMs by dynamically amplifying the signal of external context evidence during inference.

SkillGuard: A Permission Framework for Agent Skills

SkillGuard introduces a novel, skill-centric permission framework to secure LLM agent skill ecosystems by jointly regulating both context influence and runtime action side effects.

BEATS: Bootstrapping E-commerce Attribute Taxonomies for Search through Iterative Human-AI Collaboration

The paper presents BEATS, a human-in-the-loop LLM framework for bootstrapping product attribute taxonomies from scratch.

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

This paper proposes a preconditioning layer for stable weight conditioning in LLM training.

Protecting K-Nearest Neighbor Queries from Location Inference Attacks

This paper identifies two novel location inference attacks against k-nearest neighbor queries (kNNQ) and proposes DPRS, a differential privacy framework that effectively protects location privacy while maintaining high query utility.

KBSpec: LLM-driven Formal Specification Generation with Evolving Domain Knowledge Base

Proposed method, KBSpec, uses external and internal knowledge to improve formal specification generation by LLMs, increasing verification pass rates and producing more high-completeness specifications.

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

This paper introduces MMed-Bench-IR, a benchmark for multilingual medical retrieval in clinical settings, evaluating cross-lingual alignment, concept discrimination, and evidence retrieval.

CODA: Algorithm-Hardware Co-design for Edge Video Diffusion via NMP-Enabled Compute-Cache Operator Disaggregation

The paper proposes CODA, an algorithm-hardware co-designed architecture for deploying Video Diffusion Models on edge devices, achieving up to 1.80x speedup and 1.74x energy efficiency.

Learning Agile Navigation in Crowded Environments for Quadruped Robots

This paper proposes VOP-Nav, a novel navigation system for quadruped robots that combines the geometric safety of Velocity Obstacles with the agile adaptability of end-to-end learning.

Refusal is Not Safety! Benchmarking Latent Safety Risks of LLM-Driven Content Humorization

This paper explores safety risks in humorization of large language models (LLMs) and introduces HumorSafe framework for evaluating latent safety risks.

Highlighted terms show continued research focus across papers

Papers

cs.CREmpiricalRecentJul 17, 2026

Refusal is Not Safety! Benchmarking Latent Safety Risks of LLM-Driven Content Humorization

Yu Cui, Ruiqing Yue, Tingyu Li, Sicheng Pan +5 more

This paper explores safety risks in humorization of large language models (LLMs) and introduces HumorSafe framework for evaluating latent safety risks.

View →

cs.ARcs.DCEmpiricalRecent