Hoon Wei Lim
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
ARuleCon is an agentic framework that autonomously and accurately converts security rules across heterogeneous SIEM platforms, significantly outperforming baseline LLMs in fidelity.
This paper systematically analyzes the interaction of multiple weak jailbreak attacks (mutators) applied sequentially to LLMs, finding that most combinations fail due to destructive interference, revealing structural properties of model safety alignment.
Papers
Compositional Jailbreaking: An Empirical Analysis of Mutator Chain Interactions in Aligned LLMs
This paper systematically analyzes the interaction of multiple weak jailbreak attacks (mutators) applied sequentially to LLMs, finding that most combinations fail due to destructive interference, reve…