Monitoring Agentic Systems Before They're Reliable | ArxivCSExplorer