Xiongwei Han
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces RedundancyBench, a new benchmark for detecting unnecessary steps in LLM agent trajectories, finding that this task is highly complex and difficult to solve.
The paper introduces Opt-Verifier, a novel LLM-based framework that significantly improves the accuracy of automated optimization model generation by implementing dual-side verification from both structural and solution perspectives.
Papers
Redundant or Necessary? A Benchmark for Detecting Redundant Steps in Agent Trajectories
Minyang Hu, Bo Yang, Zhinuo Zhou, Jiachen Liang +3 more
The paper introduces RedundancyBench, a new benchmark for detecting unnecessary steps in LLM agent trajectories, finding that this task is highly complex and difficult to solve.