Chao Pan
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces AgentRAE, a novel backdoor attack that successfully forces mobile GUI agents to execute remote actions using visually natural triggers found in system notifications, achieving high success rates while remaining difficult to detect.
The paper introduces SafeRedirect, a system-level defense that prevents frontier LLMs from generating harmful content during legitimate tasks that structurally require it, significantly reducing unsafe generation rates.
Papers
SafeRedirect: Defeating Internal Safety Collapse via Task-Completion Redirection in Frontier LLMs
The paper introduces SafeRedirect, a system-level defense that prevents frontier LLMs from generating harmful content during legitimate tasks that structurally require it, significantly reducing unsaf…