Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:
ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Home/Authors/Yu Lin

Yu Lin

3 indexed papers

Recent (6 mo)
3
With code
0
Influential cites
0
Benchmarked
0

Publications per year

3
26

Top categories

NLP×2AI×2Software Eng.×1

Frequent co-authors

Hongyu Lin2×
Xianpei Han2×
Le Sun2×
Yaojie Lu2×
Yanjiang Liu1×
Jie Lou1×

Research Timeline

2026
Data-Efficient On-Policy Distillation for Automatic Speech Recognition

The paper demonstrates that using on-policy distillation from a strong teacher model significantly improves the performance of compact Automatic Speech Recognition (ASR) models, achieving competitive results with a much smaller audio dataset compared to supervised fine-tuning.

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

The paper introduces Lookahead Group Reward (&) to combat Supervision Fidelity Decay (SFD) in on-policy distillation, significantly improving student model performance on long reasoning tasks.

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

The paper introduces Atomic Decomposition and Recombination (ADR), a novel framework that generates genuinely novel and challenging verifiable code tasks, significantly improving the scalability of Reinforcement Learning with Verifiable Rewards (RLVR) for LLMs.

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIRecentMay 29, 2026

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

Yanjiang Liu, Jie Lou, Xinyan Guan, Yuqiu Ji +6 more

The paper introduces Lookahead Group Reward (&) to combat Supervision Fidelity Decay (SFD) in on-policy distillation, significantly improving student model performance on long reasoning tasks.

View →
cs.CLcs.SERecentMay 29, 2026

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

Jiasheng Zheng, Boxi Cao, Boxi Yu, Yuzhong Zhang +5 more

The paper introduces Atomic Decomposition and Recombination (ADR), a novel framework that generates genuinely novel and challenging verifiable code tasks, significantly improving the scalability of Re…

View →
cs.AIRecentMay 27, 2026

Data-Efficient On-Policy Distillation for Automatic Speech Recognition

Yu Lin, Yiming Wang, Runyuan Cai, Xiaodong Zeng

The paper demonstrates that using on-policy distillation from a strong teacher model significantly improves the performance of compact Automatic Speech Recognition (ASR) models, achieving competitive…

View →