Xiangliang Zhang
3 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.
AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.
AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent language agents from executing unauthorized side effects, significantly reducing attack success rates on agent-specific vulnerabilities.
Papers
AIRGuard: Guarding Agent Actions with Runtime Authority Control
Suliu Qin, Haomin Zhuang, Yujun Zhou, Yufei Han +1 more
AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.