A Cross-Modal Prompt Injection Attack against Large Vision-Language Models with Image-Only Perturbation

Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual…

The paper introduces ImageProtector, a user-side method that embeds an impercept…

WebAgentGuard: A Reasoning-Driven Guard Model for Detecting Prompt Injection Attacks in Web Agents

The paper introduces WebAgentGuard, a novel reasoning-driven, multimodal guard m…

DeepSeek Robustness Against Semantic-Character Dual-Space Mutated Prompt Injection

The paper introduces PromptFuzz-SC, a novel semantic-character dual-space mutati…

PIArena: A Platform for Prompt Injection Evaluation

The paper introduces PIArena, a unified and extensible platform designed to addr…

AgentWatcher: A Rule-based Prompt Injection Monitor

AgentWatcher is a novel, rule-based monitor designed to detect prompt injection…

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injecti…

The paper proposes a vision for system-level defenses against indirect prompt in…

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Syst…

The paper introduces Prompt Control-Flow Integrity (PCFI), a priority-aware runt…

Are AI-assisted Development Tools Immune to Prompt Injection?

The paper empirically analyzes the susceptibility of seven widely used AI-assist…