Robust Reasoning via Dynamic Token Selection for Distribution-Aligned Self-Distillation | ArxivCSExplorer