LoopTrap: Termination Poisoning Attacks on LLM Agents

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

The paper introduces T-MAP, a trajectory-aware evolutionary search method, to di…

Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems

The paper introduces Document-Driven Implicit Payload Execution (DDIPE) to demon…

Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

The paper introduces AutoMIA, a novel framework that uses LLM agents to automate…

The Autonomy Tax: Defense Training Breaks LLM Agents

Defense training for LLM agents, intended to improve safety, systematically degr…

Secure Forgetting: A Framework for Privacy-Driven Unlearning in Large Language Model (LLM)-Based Age…

The paper proposes a comprehensive framework for LLM-based agent unlearning, ena…

Evaluating Privilege Usage of Agents with Real-World Tools

The paper introduces GrantBox, a new security sandbox that evaluates how well LL…

Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents

The paper introduces eTAMP, a novel attack that poisons LLM web agents' memory u…

Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance

This paper identifies and characterizes 'guidance injection,' a stealthy attack…