Xiaolong Yin
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
AI×1
Frequent co-authors
Research Timeline
2026
OpenClawBench: Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories
The paper introduces OpenClawBench, a large-scale dataset and framework for measuring process-side anomalies in real-world agent execution trajectories, demonstrating that task success does not guarantee operational reliability.
Highlighted terms show continued research focus across papers
Papers
cs.AIRecentMay 28, 2026
OpenClawBench: Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories
Yibing Liu, Yangze Liu, Xiaolong Yin, Bin Wang +3 more
The paper introduces OpenClawBench, a large-scale dataset and framework for measuring process-side anomalies in real-world agent execution trajectories, demonstrating that task success does not guaran…
View →