Yuxi Zhou

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×1

Frequent co-authors

Zhihao Liu1×

Yifan Wu1×

Jian Lou1×

Di Wang1×

Yuke Hu1×

Research Timeline

2026

Aligned but Fragile: Enhancing LLM Safety Robustness via Zeroth-Order Optimization

The paper proposes a novel zeroth-order optimization framework to enhance the robustness of LLM safety alignment, showing that few refinement steps can significantly improve safety while maintaining utility.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 28, 2026

Aligned but Fragile: Enhancing LLM Safety Robustness via Zeroth-Order Optimization

Zhihao Liu, Yifan Wu, Jian Lou, Di Wang +2 more

View →