Yuan Xin
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
NLP×1Crypto×1
Frequent co-authors
Research Timeline
2026
SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts
The paper proposes SafeReview, a co-evolutionary adversarial training framework that significantly improves the robustness of LLM-based peer review systems against sophisticated adversarial hidden prompts.
Highlighted terms show continued research focus across papers
Papers
cs.CLcs.CRRecentApr 29, 2026
SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts
Yuan Xin, Yixuan Weng, Minjun Zhu, Ying Ling +4 more
The paper proposes SafeReview, a co-evolutionary adversarial training framework that significantly improves the robustness of LLM-based peer review systems against sophisticated adversarial hidden pro…
View →