David Yao

1 indexed paper

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

ML×1AI×1

Frequent co-authors

Hanyang Zhao1×

Haoxian Chen1×

Han Lin1×

Genta Indra Winata1×

Wenpin Tang1×

Research Timeline

2026

OPD+: Rethinking the Advantage Design for On-Policy Distillation

The paper introduces OPD+, a corrected on-policy distillation framework that mathematically proves the bias of standard stop-gradient methods and improves the stability and performance of knowledge transfer from teacher to student models.

Highlighted terms show continued research focus across papers

Papers

cs.LGcs.AIRecentMay 31, 2026

OPD+: Rethinking the Advantage Design for On-Policy Distillation

Hanyang Zhao, Haoxian Chen, Han Lin, Genta Indra Winata +2 more

View →