Qiuchi Xiang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper develops a novel attack method for multi-agent discussions under continuous monitoring, demonstrating that monitoring alone is insufficient to secure these systems.
This paper introduces the 'wide-net-casting' jailbreak scenario, demonstrating that querying a group of large language models can expose significant, previously overlooked safety risks, with a novel method achieving 100% jailbreak success in some tests.
Papers
New Wide-Net-Casting Jailbreak Attacks Risk Large Models
This paper introduces the 'wide-net-casting' jailbreak scenario, demonstrating that querying a group of large language models can expose significant, previously overlooked safety risks, with a novel m…