Junyu Wang

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×1

Frequent co-authors

Yuanbo Zhou1×

Changjia Zhu1×

Xu He1×

Yan Zhai1×

Kun Sun1×

Mingkui Wei1×

Research Timeline

2026

Prompt Overflow: What the Guardrail Inspects Is Not What the Model Infers

The paper introduces the Prompt Overflow Attack, demonstrating that guardrail models inspecting truncated or segmented inputs fail to detect malicious instructions that are only actionable when the full, overlong context is provided to the downstream LLM.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentMay 22, 2026

Prompt Overflow: What the Guardrail Inspects Is Not What the Model Infers

Yuanbo Zhou, Changjia Zhu, Junyu Wang, Xu He +4 more

View →