Huan Sun

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

NLP×2AI×1

Frequent co-authors

Yu Su2×

Yiheng Shu1×

Bernal Jiménez Gutiérrez1×

Saisri Padmaja Jonnalagedda1×

Yuguang Yao1×

Yuting Ning1×

Research Timeline

2026

AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

The paper introduces AGENTCL, a rigorous evaluation framework that uses controlled task streams to accurately measure an agent's ability to accumulate and reuse knowledge across multiple tasks, thereby addressing limitations in current continual learning benchmarks.

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

The paper introduces SkillHarm, a comprehensive benchmark and automated framework for evaluating skill-based attacks across the entire agent skill-use lifecycle, demonstrating that current agents remain highly vulnerable to both fixed-payload and self-mutating poisoning attacks.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CLRecentJun 1, 2026

AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

Yiheng Shu, Bernal Jiménez Gutiérrez, Saisri Padmaja Jonnalagedda, Yuguang Yao +2 more

View →

cs.CLRecentJun 1, 2026