Liu Qing
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Research Timeline
2026
AlphaToken: Decoupling Adaptation and Stability for Path-Aware Response Token Valuation in LLM Post-Training
AlphaToken is a novel response token valuation framework that improves LLM post-training by decoupling token selection into task-specific adaptation and stability preservation, leading to better performance and reduced catastrophic forgetting.
Highlighted terms show continued research focus across papers