Jun Zhou

7 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×6Crypto×5NLP×1

Frequent co-authors

Haomin Zhuang3×

Yujun Zhou3×

Yufei Han3×

Xiangliang Zhang3×

Zeli Su2×

Zhankai Xu2×

Research Timeline

2026

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

The paper introduces AgentTrap, a dynamic benchmark that measures LLM agent susceptibility to malicious side effects embedded within seemingly benign third-party skills, finding that agents often execute unsafe side effects while completing the visible user task.

A First Measurement Study on Authentication Security in Real-World Remote MCP Servers

This study provides the first measurement of authentication security in real-world remote Model Context Protocol (MCP) servers, finding pervasive and critical authentication weaknesses, particularly in dynamic client registration.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent agent attacks by enforcing step-level authorization over external side effects.

AIRGuard: Guarding Agent Actions with Runtime Authority Control

AIRGuard is a runtime authority control guard that operationalizes least privilege to prevent language agents from executing unauthorized side effects, significantly reducing attack success rates on agent-specific vulnerabilities.

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

The paper introduces Source-Grounded Semantic Reinforcement Learning (SG-SRL), a framework that leverages abundant source-language monolingual data to improve target-language generation in low-resource settings by providing cross-lingual semantic supervision.

The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF

The paper introduces DistractionIF, a benchmark showing that larger LLMs are paradoxically less robust to benign, instruction-like noise in reference text, suggesting reinforcement learning can restore this robustness.

MemSecBench: Tracking Agent Memory Poisoning from Persistence to Consequence and Repair

The paper introduces MemSecBench, a benchmark for evaluating the lifecycle security of agent memory systems against malicious semantics, and reports results across various configurations.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIEmpiricalRecentJul 29, 2026

MemSecBench: Tracking Agent Memory Poisoning from Persistence to Consequence and Repair

Xuanze Chen, Xukang Xie, Wentao Fu, Jiajun Zhou +2 more

The paper introduces MemSecBench, a benchmark for evaluating the lifecycle security of agent memory systems against malicious semantics, and reports results across various configurations.

View →

cs.CLcs.AIRecent