Ruiying Du
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
Babel: Jailbreaking Safety Attention via Obfuscation Distribution Optimized Sampling
The paper introduces Babel, an efficient black-box attack framework that systematically exploits intrinsic safety gaps in LLMs by optimizing text obfuscation sampling, achieving state-of-the-art jailbreak success rates on commercial models.
Highlighted terms show continued research focus across papers
Papers
cs.CRcs.AIRecentMay 18, 2026
Babel: Jailbreaking Safety Attention via Obfuscation Distribution Optimized Sampling
Ziwei Wang, Jing Chen, Ruichao Liang, Zhi Wang +5 more
The paper introduces Babel, an efficient black-box attack framework that systematically exploits intrinsic safety gaps in LLMs by optimizing text obfuscation sampling, achieving state-of-the-art jailb…
View →