Jun Zhu

3 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×3NLP×1ML×1AI×1Robotics×1

Frequent co-authors

Yuan Xin1×

Yixuan Weng1×

Minjun Zhu1×

Ying Ling1×

Chengwei Qin1×

Michael Backes1×

Research Timeline

2026

TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches

This paper introduces TRAP, an adversarial attack that demonstrates how physical patches can hijack the Chain-of-Thought (CoT) reasoning process in Vision-Language-Action (VLA) models, forcing them to perform unintended actions.

Dummy-Aware Weighted Attack (DAWA): Breaking the Safe Sink in Dummy Class Defenses

The paper introduces Dummy-Aware Weighted Attack (DAWA), a novel evaluation method that significantly reduces the reported robustness of Dummy Classes-based defenses by simultaneously targeting both the true and dummy class labels.

SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts

The paper proposes SafeReview, a co-evolutionary adversarial training framework that significantly improves the robustness of LLM-based peer review systems against sophisticated adversarial hidden prompts.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.CRRecentApr 29, 2026

SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts

Yuan Xin, Yixuan Weng, Minjun Zhu, Ying Ling +4 more

View →

cs.LGcs.CRRecentMar 31, 2026