Massive Spikes in LLMs are Bias Vectors: Mechanistic Uncovering and Spike-Free Quantization | ArxivCSExplorer