On the Generalization Gap in Self-Evolving Language Model Reasoning | ArxivCSExplorer