Extreme Low-Bit Inference in Reasoning Models: Failure Modes and Targeted Recovery | ArxivCSExplorer