Yingjie Feng
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
AI×1
Frequent co-authors
Research Timeline
2026
Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling
The paper introduces Graph-Distance Contribution Reward (GDCR) and Step Advantage Policy Optimization (SAPO) to provide fine-grained, step-level credit assignment for agentic search by modeling world knowledge as a latent graph.
Highlighted terms show continued research focus across papers
Papers
cs.AIRecentMay 28, 2026
Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling
Yuchen Liu, Yingjie Feng, Lixiong Qin, Jiasi Chen +4 more
The paper introduces Graph-Distance Contribution Reward (GDCR) and Step Advantage Policy Optimization (SAPO) to provide fine-grained, step-level credit assignment for agentic search by modeling world…
View →