Zhibo Zhang

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1

Frequent co-authors

Jiaqing Li1×

Shide Zhou1×

Yuxi Li1×

Tianlong Yu1×

Kailong Wang1×

Research Timeline

2026

When Safe Models Merge into Danger: Exploiting Latent Vulnerabilities in LLM Fusion

The paper introduces TrojanMerge, a framework demonstrating that model merging can be exploited to systematically compromise the safety alignment of multiple individually safe LLMs.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentApr 1, 2026

When Safe Models Merge into Danger: Exploiting Latent Vulnerabilities in LLM Fusion

Jiaqing Li, Zhibo Zhang, Shide Zhou, Yuxi Li +2 more

The paper introduces TrojanMerge, a framework demonstrating that model merging can be exploited to systematically compromise the safety alignment of multiple individually safe LLMs.

View →