Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yuanbo Zhou

Yuanbo Zhou

1 indexed paper

Recent (6 mo)
1
With code
0
Influential cites
0
Benchmarked
0

Publications per year

1
26

Top categories

Crypto×1

Frequent co-authors

Changjia Zhu1×
Junyu Wang1×
Xu He1×
Yan Zhai1×
Kun Sun1×
Mingkui Wei1×

Research Timeline

2026
Prompt Overflow: What the Guardrail Inspects Is Not What the Model Infers

The paper introduces the Prompt Overflow Attack, demonstrating that guardrail models inspecting truncated or segmented inputs fail to detect malicious instructions that are only actionable when the full, overlong context is provided to the downstream LLM.

Highlighted terms show continued research focus across papers

Papers

cs.CRRecentMay 22, 2026

Prompt Overflow: What the Guardrail Inspects Is Not What the Model Infers

Yuanbo Zhou, Changjia Zhu, Junyu Wang, Xu He +4 more

The paper introduces the Prompt Overflow Attack, demonstrating that guardrail models inspecting truncated or segmented inputs fail to detect malicious instructions that are only actionable when the fu…

View →