Yihao Zhang

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2AI×1NLP×1Vision×1ML×1

Frequent co-authors

Meng Sun2×

Xiaokun Luan1×

Pengcheng Su1×

Feiran Lei1×

Kai Wang1×

Jiangrong Wu1×

Research Timeline

2026

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

The paper introduces Salami Slicing Risk, a novel multi-turn jailbreak technique that accumulates harmful intent through numerous low-risk inputs, achieving state-of-the-art attack success rates against major LLMs.

VOW: Verifiable and Oblivious Watermark Detection for Large Language Models

VOW introduces a novel, privacy-preserving, and cryptographically verifiable protocol for detecting watermarks in LLM-generated text, overcoming the limitations of centralized and non-verifiable existing methods.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentApr 30, 2026

VOW: Verifiable and Oblivious Watermark Detection for Large Language Models

Xiaokun Luan, Yihao Zhang, Pengcheng Su, Feiran Lei +1 more

View →

cs.CRcs.AIcs.CLRecentApr 13, 2026