Ming Gong
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces the concept of policy-invisible violations in LLM agents and proposes Sentinel, a counterfactual graph simulation framework, which significantly improves policy enforcement accuracy by incorporating hidden world-state context.
The paper introduces Distributed Sentinel, a zero-trust architecture that prevents Context-Fragmented Violations (CFVs) in multi-agent systems by propagating security state across departmental boundaries.
Papers
Beyond Single-Agent Alignment: Preventing Context-Fragmented Violations in Multi-Agent Systems
The paper introduces Distributed Sentinel, a zero-trust architecture that prevents Context-Fragmented Violations (CFVs) in multi-agent systems by propagating security state across departmental boundar…