Zhibo Zhang
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1
Frequent co-authors
Research Timeline
2026
When Safe Models Merge into Danger: Exploiting Latent Vulnerabilities in LLM Fusion
The paper introduces TrojanMerge, a framework demonstrating that model merging can be exploited to systematically compromise the safety alignment of multiple individually safe LLMs.
Highlighted terms show continued research focus across papers
Papers
cs.CRRecentApr 1, 2026
When Safe Models Merge into Danger: Exploiting Latent Vulnerabilities in LLM Fusion
Jiaqing Li, Zhibo Zhang, Shide Zhou, Yuxi Li +2 more
The paper introduces TrojanMerge, a framework demonstrating that model merging can be exploited to systematically compromise the safety alignment of multiple individually safe LLMs.
View →