Hao Zheng

4 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×3ML×2Crypto×2NLP×1

Frequent co-authors

Taco Cohen1×

Benjamin Negrevergne1×

Research Timeline

2026

RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience

The paper introduces RLSpoofer, a lightweight, black-box reinforcement learning attack that demonstrates the fragile resilience of current LLM watermarking schemes by achieving a high spoofing success rate with minimal training data.

Hallucination as Exploit: Evidence-Carrying Multimodal Agents

The paper introduces Evidence-Carrying Agents (ECA) to prevent multimodal agents from executing privileged actions based on unsupported or hallucinated perceptual claims, achieving near-zero unsafe execution rates.

Extrapolative Weight Averaging Reveals Correctness-Efficiency Frontiers in Code RL

The paper demonstrates that extrapolative weight averaging can effectively navigate and extend the correctness-efficiency frontier in code RL, leading to improved performance on complex programming tasks.

Dr-CiK: A Testbed for Foresight-Driven Agents

The paper introduces Dr-CiK, a new benchmark designed to evaluate agents' ability to proactively discover, filter, and utilize relevant external context for time series forecasting, demonstrating that current agents struggle significantly with this task.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIcs.CLRecentMay 27, 2026

Extrapolative Weight Averaging Reveals Correctness-Efficiency Frontiers in Code RL

Kunhao Zheng, Pierre Chambon, Juliette Decugis, Jonas Gehring +3 more

View →

cs.AIcs.LGRecentMay 27, 2026