Ken Huang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces LAAF, a novel automated red-teaming framework, to systematically test and exploit Logic-layer Prompt Control Injection (LPCI) vulnerabilities in complex agentic LLM systems.
The paper proves that no continuous, utility-preserving wrapper defense can make all inputs strictly safe for a language model with a connected prompt space, establishing a 'defense trilemma' among continuity, utility preservation, and completeness.
Papers
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
The paper proves that no continuous, utility-preserving wrapper defense can make all inputs strictly safe for a language model with a connected prompt space, establishing a 'defense trilemma' among co…