Honghao Liu

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1AI×1

Frequent co-authors

Chengjin Xu1×

Xuhui Jiang1×

Cehao Yang1×

Shengming Yin1×

Zhengwu Ma1×

Lionel Ni1×

Research Timeline

2026

Conflicts Make Large Reasoning Models Vulnerable to Attacks

The paper demonstrates that confronting Large Reasoning Models (LRMs) with conflicting objectives, such as contradictory choices or conflicting alignment values, significantly increases their vulnerability to harmful attacks.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIRecentApr 10, 2026

Conflicts Make Large Reasoning Models Vulnerable to Attacks

Honghao Liu, Chengjin Xu, Xuhui Jiang, Cehao Yang +4 more

View →