Yili Shen

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1AI×1

Frequent co-authors

Haomin Zhuang1×

Hanwen Xing1×

Yujun Zhou1×

Yuchen Ma1×

Yue Huang1×

Yufei Han1×

Research Timeline

2026

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentMay 13, 2026

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

Haomin Zhuang, Hanwen Xing, Yujun Zhou, Yuchen Ma +4 more

View →