Nesreen K. Ahmed
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
ML×1AI×1NLP×1Crypto×1
Frequent co-authors
Research Timeline
2026
Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning
The paper introduces Agent-ToM, a Theory-of-Mind (ToM) based framework that learns to monitor autonomous LLM agents by explicitly reasoning about their hidden beliefs and intentions to detect covert malicious behavior.
Highlighted terms show continued research focus across papers