S-SPPO: Semantic-Calibrated Self-Play Preference Optimization | ArxivCSExplorer