Benchmarking Autonomous Agents against Temporal, Spatial, and Semantic Evasions | ArxivCSExplorer