Xianglin Yang

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×2Crypto×2Software Eng.×1ML×1

Frequent co-authors

Jin Song Dong2×

Xiaoyue Lu1×

Haijun Liu1×

Jiahao Liu1×

Kuntai Cai1×

Yan Xiao1×

Research Timeline

2026

Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications

The paper introduces POLARIS, a novel framework that systematically generates comprehensive and verifiable safety tests for LLMs by formalizing natural language policies into First-Order Logic and exploring the resulting Semantic Policy Graph.

Turning Bias into Bugs: Bandit-Guided Style Manipulation Attacks on LLM Judges

The paper introduces BITE, a black-box adversarial framework that exploits stylistic biases in LLM judges by adaptively generating semantically equivalent edits to artificially inflate assigned scores.

Highlighted terms show continued research focus across papers

Papers

cs.AIcs.CRcs.SERecentMay 24, 2026

Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications

Xiaoyue Lu, Xianglin Yang, Haijun Liu, Jiahao Liu +3 more

View →

cs.CRcs.AIcs.LGRecentMay 24, 2026