Zeming Wei

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2AI×2NLP×2ML×2Multiagent×1Vision×1

Frequent co-authors

Kai Wang2×

Chang Jin1×

An Wang1×

Biaojie Zeng1×

Qiaosheng Zhang1×

Chao Yang1×

Research Timeline

2026

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

The paper introduces Salami Slicing Risk, a novel multi-turn jailbreak technique that accumulates harmful intent through numerous low-risk inputs, achieving state-of-the-art attack success rates against major LLMs.

SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces

The paper introduces SkillSafetyBench, a comprehensive benchmark demonstrating that agent safety failures often stem from adversarial influences within reusable skills and execution environments, rather than just malicious user prompts.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.CLRecentMay 12, 2026

SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces

Chang Jin, An Wang, Zeming Wei, Kai Wang +6 more

View →

cs.CRcs.AIcs.CLRecentApr 13, 2026