Eugene Bagdasarian
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces 'contrastive privacy,' a formal, model-agnostic, and quantitative method for evaluating the semantic success of AI-based sanitization across multiple media modalities.
The paper argues that prompt injection is a fundamental vulnerability in AI agents, proposing that Contextual Integrity (CI) offers a principled framework to understand and mitigate context-sensitive failures, suggesting that current defenses are insufficient.
Papers
AI Agents May Always Fall for Prompt Injections
The paper argues that prompt injection is a fundamental vulnerability in AI agents, proposing that Contextual Integrity (CI) offers a principled framework to understand and mitigate context-sensitive…