A Framework for Formalizing LLM Agent Security

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injecti…

The paper proposes a vision for system-level defenses against indirect prompt in…

Agent Audit: A Security Analysis System for LLM Agent Applications

Agent Audit is a novel security analysis system that comprehensively audits LLM…

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

The paper introduces ClawTrap, a MITM-based red-teaming framework, to evaluate t…

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

The paper proposes a unified closed-loop threat taxonomy to systematically analy…

Revisiting Vulnerability Patch Identification on Data in the Wild

The paper demonstrates that security patch detection models trained solely on pu…

ClawLess: A Security Model of AI Agents

ClawLess introduces a formally verified security framework that enforces fine-gr…

Evaluating Privilege Usage of Agents with Real-World Tools

The paper introduces GrantBox, a new security sandbox that evaluates how well LL…

Unveiling the Security Risks of Federated Learning in the Wild: From Research to Practice

This paper argues that much of the existing research on Federated Learning (FL)…