A Structured Benchmark for Text-Guided Anomaly Detection: When Language Stops Conditioning the Decision | ArxivCSExplorer