Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Sibei Yang

Sibei Yang

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Crypto×1AI×1

Frequent co-authors

Yi Wang1×
Hongye Qiu1×
Yue Xu1×
Zhan Qin1×
Minlie Huang1×
Wenjie Wang1×

Research Timeline

2026
EVA: Editing for Versatile Alignment against Jailbreaks

The paper proposes EVA, a novel framework that uses direct model editing to surgically correct specific neurons responsible for jailbreaking vulnerabilities in LLMs and VLMs, achieving robust safety alignment without performance degradation.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentMay 14, 2026

EVA: Editing for Versatile Alignment against Jailbreaks

Yi Wang, Hongye Qiu, Yue Xu, Sibei Yang +3 more

The paper proposes EVA, a novel framework that uses direct model editing to surgically correct specific neurons responsible for jailbreaking vulnerabilities in LLMs and VLMs, achieving robust safety a…

View →