Biaojie Zeng
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1NLP×1ML×1Multiagent×1
Frequent co-authors
Research Timeline
2026
SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces
The paper introduces SkillSafetyBench, a comprehensive benchmark demonstrating that agent safety failures often stem from adversarial influences within reusable skills and execution environments, rather than just malicious user prompts.
Highlighted terms show continued research focus across papers
Papers
cs.CRcs.AIcs.CLRecentMay 12, 2026
SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces
Chang Jin, An Wang, Zeming Wei, Kai Wang +6 more
The paper introduces SkillSafetyBench, a comprehensive benchmark demonstrating that agent safety failures often stem from adversarial influences within reusable skills and execution environments, rath…
View →