Yifei He

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×1

Frequent co-authors

Rui Yang1×

Hao Bai1×

Tong Zhang1×

Han Zhao1×

Research Timeline

2026

PRO-CUA: Process-Reward Optimization for Computer Use Agents

PRO-CUA introduces a process-reward optimization framework that enables efficient, step-level reinforcement learning for training computer use agents by decoupling environment interaction from policy optimization.

Highlighted terms show continued research focus across papers

Papers

cs.AIRecentMay 27, 2026

PRO-CUA: Process-Reward Optimization for Computer Use Agents

Yifei He, Rui Yang, Hao Bai, Tong Zhang +1 more

View →