Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Jun Zhou

Jun Zhou

6 indexed papers

Recent (6 mo)
6
With code
0
Influential cites
0
Benchmarked
0

Publications per year

6
26

Top categories

AI×5Crypto×4NLP×1

Frequent co-authors

Haomin Zhuang3×
Yujun Zhou3×
Yufei Han3×
Xiangliang Zhang3×
Zeli Su2×
Zhankai Xu2×

Research Timeline

2026
AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly in dynamic client registration.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent language agents from executing unauthorized side effects, significantly reducing attack success rates on agent-specific vulnerabilities.

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

The paper introduces Source-Grounded Semantic Reinforcement Learning (SG-SRL), a framework that leverages abundant source-language monolingual data to improve target-language generation in low-resource settings by providing cross-lingual semantic supervision.

The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF

The paper introduces DistractionIF, a benchmark showing that larger LLMs are paradoxically less robust to benign, instruction-like noise in reference text, suggesting reinforcement learning can restore this robustness.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIRecentMay 28, 2026

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

Zeli Su, Ziyin Zhang, Zewei Pan, Zhou Liu +7 more

The paper introduces Source-Grounded Semantic Reinforcement Learning (SG-SRL), a framework that leverages abundant source-language monolingual data to improve target-language generation in low-resourc…

View →
cs.AIRecentMay 28, 2026

The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF

Zeli Su, Zhankai Xu, Tianlei Chen, Longfei Zheng +3 more

The paper introduces DistractionIF, a benchmark showing that larger LLMs are paradoxically less robust to benign, instruction-like noise in reference text, suggesting reinforcement learning can restor…

View →
cs.CRcs.AIRecentMay 27, 2026

AIRGuard: Guarding Agent Actions with Runtime Authority Control

Suliu Qin, Haomin Zhuang, Yujun Zhou, Yufei Han +1 more

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.

View →
cs.CRcs.AIRecentMay 27, 2026

AIRGuard: Guarding Agent Actions with Runtime Authority Control

Suliu Qin, Haomin Zhuang, Yujun Zhou, Yufei Han +1 more

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent language agents from executing unauthorized side effects, significantly reducing attack success rates on a…

View →
cs.CRRecentMay 21, 2026

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

Huijun Zhou, Xiaohan Zhang, Haozhe Zhang, Haoyang Zhang +2 more

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly i…

View →
cs.CRcs.AIRecentMay 13, 2026

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

Haomin Zhuang, Hanwen Xing, Yujun Zhou, Yuchen Ma +4 more

The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often exec…

View →