Honghao Liu
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Crypto×1AI×1
Frequent co-authors
Research Timeline
2026
Conflicts Make Large Reasoning Models Vulnerable to Attacks
The paper demonstrates that confronting Large Reasoning Models (LRMs) with conflicting objectives, such as contradictory choices or conflicting alignment values, significantly increases their vulnerability to harmful attacks.
Highlighted terms show continued research focus across papers
Papers
cs.CRcs.AIRecentApr 10, 2026
Conflicts Make Large Reasoning Models Vulnerable to Attacks
Honghao Liu, Chengjin Xu, Xuhui Jiang, Cehao Yang +4 more
The paper demonstrates that confronting Large Reasoning Models (LRMs) with conflicting objectives, such as contradictory choices or conflicting alignment values, significantly increases their vulnerab…
View →