Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Hao Zheng

Hao Zheng

4 indexed papers

Recent (6 mo)
4
With code
0
Influential cites
0
Benchmarked
0

Publications per year

4
26

Top categories

AI×3ML×2Crypto×2NLP×1

Frequent co-authors

Kunhao Zheng1×
Pierre Chambon1×
Juliette Decugis1×
Jonas Gehring1×
Taco Cohen1×
Benjamin Negrevergne1×

Research Timeline

2026
RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience

The paper introduces RLSpoofer, a lightweight, black-box reinforcement learning attack that demonstrates the fragile resilience of current LLM watermarking schemes by achieving a high spoofing success rate with minimal training data.

Hallucination as Exploit: Evidence-Carrying Multimodal Agents

The paper introduces Evidence-Carrying Agents (ECA) to prevent multimodal agents from executing privileged actions based on unsupported or hallucinated perceptual claims, achieving near-zero unsafe execution rates.

Extrapolative Weight Averaging Reveals Correctness-Efficiency Frontiers in Code RL

The paper demonstrates that extrapolative weight averaging can effectively navigate and extend the correctness-efficiency frontier in code RL, leading to improved performance on complex programming tasks.

Dr-CiK: A Testbed for Foresight-Driven Agents

The paper introduces Dr-CiK, a new benchmark designed to evaluate agents' ability to proactively discover, filter, and utilize relevant external context for time series forecasting, demonstrating that current agents struggle significantly with this task.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIcs.CLRecentMay 27, 2026

Extrapolative Weight Averaging Reveals Correctness-Efficiency Frontiers in Code RL

Kunhao Zheng, Pierre Chambon, Juliette Decugis, Jonas Gehring +3 more

The paper demonstrates that extrapolative weight averaging can effectively navigate and extend the correctness-efficiency frontier in code RL, leading to improved performance on complex programming ta…

View →
cs.AIcs.LGRecentMay 27, 2026

Dr-CiK: A Testbed for Foresight-Driven Agents

Yihong Tang, Andrew Robert Williams, Arjun Ashok, Vincent Zhihao Zheng +5 more

The paper introduces Dr-CiK, a new benchmark designed to evaluate agents' ability to proactively discover, filter, and utilize relevant external context for time series forecasting, demonstrating that…

View →
cs.AIcs.CRRecentMay 18, 2026

Hallucination as Exploit: Evidence-Carrying Multimodal Agents

Guijia Zhang, Hao Zheng, Harry Yang

The paper introduces Evidence-Carrying Agents (ECA) to prevent multimodal agents from executing privileged actions based on unsupported or hallucinated perceptual claims, achieving near-zero unsafe ex…

View →
cs.CRRecentApr 13, 2026

RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience

Hanbo Huang, Xuan Gong, Yiran Zhang, Hao Zheng +1 more

The paper introduces RLSpoofer, a lightweight, black-box reinforcement learning attack that demonstrates the fragile resilience of current LLM watermarking schemes by achieving a high spoofing success…

View →