Jerome Sieber
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
ML×1AI×1
Frequent co-authors
Research Timeline
2026
A Predictive Law for On-Policy Self-Distillation From World Feedback
The paper identifies a linear predictive law linking the initial performance gap in on-policy self-distillation (OPSD) to the final performance improvement, allowing researchers to anticipate and tune OPSD outcomes before full training.
Highlighted terms show continued research focus across papers