Yili Shen
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills
The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.
Highlighted terms show continued research focus across papers
Papers
cs.CRcs.AIRecentMay 13, 2026
AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills
Haomin Zhuang, Hanwen Xing, Yujun Zhou, Yuchen Ma +4 more
The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often exec…
View →