Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces | ArxivCSExplorer