He Zhu

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×1NLP×1

Frequent co-authors

Qianqian Xie2×

Jiaheng Liu2×

Jiaming Wang1×

Ziteng Feng1×

Jiangtao Wu1×

Ruihao Li1×

Research Timeline

2026

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

The paper introduces TELBench and the DRIFT framework to enable fine-grained, span-level error localization in deep-research agents, significantly improving the ability to pinpoint exactly where an agent's reasoning fails.

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

The paper introduces TVIR, a new benchmark and multi-agent framework for deep research, to evaluate and improve the generation of factually reliable, text-visual interleaved reports.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentJun 1, 2026

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Jiaming Wang, Ziteng Feng, Jiangtao Wu, Ruihao Li +7 more

View →

cs.CLRecentJun 1, 2026