Yuan Wang

12 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×9Crypto×7Software Eng.×3NLP×2ML×2Info Retrieval×1Vision×1

Frequent co-authors

Su Wang3×

Pin Qian3×

Yihang Chen3×

Junxian You3×

Xiaoyuan Wang3×

Xiaochong Jiang2×

Research Timeline

2026

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

ClawKeeper is a comprehensive, multi-layered security framework designed to mitigate critical vulnerabilities in autonomous agent runtimes like OpenClaw by enforcing protection across skills, plugins, and system state.

Do Phone-Use Agents Respect Your Privacy?

The paper introduces MyPhoneBench, a new framework that demonstrates that current phone-use agents often fail to respect user privacy, even when successfully completing simple tasks, primarily due to unnecessary data disclosure.

How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection

The paper demonstrates that using raw source text for fine-tuning LLMs on vulnerability detection causes high false-positive rates by memorizing surface-level syntax, a problem mitigated by using Abstract Syntax Trees (ASTs) during inference.

Bridging the Detection-to-Abstention Gap in Reasoning Models under Insufficient Information

The paper addresses the 'detection-to-abstention gap' in reasoning models, where detecting insufficient information does not lead to abstention, by proposing a novel control framework that forces models to commit to an answerability judgment before solving.

Relevant Is Not Warranted: Evidence-Force Calibration for Cited RAG

The paper introduces FORCEBENCH, a new stress test designed to evaluate whether cited sources genuinely warrant the strength of a claim, revealing that standard citation evaluation methods often fail to detect over-strong claims.

Density-aware Sample-specific Attack

This paper proposes a density-aware attack that constructs triggers by placing poisoned samples in low-density regions of the clean data distribution, achieving high attack success rates even after strong post-training defenses.

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

DynaTree introduces a two-stage framework that pre-constructs a reusable retrieval tree offline using coordinated agents, allowing for efficient, structure-aware, and highly effective time-sensitive news retrieval online.

SpatialAct: Probing Spatial Reasoning-to-Action Capabilities of VLM Agents in 3D Scenes

The paper introduces SpatialAct, a challenging benchmark that reveals a significant 'reasoning-to-action gap,' showing that current VLMs struggle to maintain coherent spatial understanding and perform reliable actions in multi-turn 3D environments.

When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems

The paper introduces SkillReact, a framework that measures compositional risk in agent skill ecosystems, finding that even if individual skills are safe, their combination can create significant, exploitable security vulnerabilities.

When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems

Pepper: High-bandwidth and Scalable Anonymous Broadcast with Cryptographic Privacy

Pepper is a novel, high-bandwidth anonymous broadcast protocol that achieves cryptographic sender anonymity and significantly improves messaging throughput compared to existing state-of-the-art systems.

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

This paper proposes a training-free framework called ReasonAlloc to mitigate inference bottlenecks in large language models by recasting decoding-time key-value compression as a hierarchical budget allocation problem.

Highlighted terms show continued research focus across papers

Papers

cs.AIEmpiricalRecentJun 9, 2026

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

Wenhao Liu, Hao Shi, Yunhe Li, Weizhi Fei +6 more

View →

cs.CRRecentJun 3, 2026