Yufei Han

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×3AI×3

Frequent co-authors

Haomin Zhuang3×

Yujun Zhou3×

Xiangliang Zhang3×

Suliu Qin2×

Hanwen Xing1×

Yuchen Ma1×

Research Timeline

2026

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent language agents from executing unauthorized side effects, significantly reducing attack success rates on agent-specific vulnerabilities.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentMay 27, 2026

AIRGuard: Guarding Agent Actions with Runtime Authority Control

Suliu Qin, Haomin Zhuang, Yujun Zhou, Yufei Han +1 more

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.

View →

cs.CRcs.AIRecentMay 27, 2026