Neil Zhenqiang Gong
6 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces ImageProtector, a user-side method that embeds an imperceptible perturbation into images to prevent Multi-modal Large Language Models (MLLMs) from analyzing and extracting sensitive information from them.
This paper systematically studies the robustness of vision foundation models to common image perturbations, finding that most models are generally non-robust and proposing a fine-tuning method to improve this resilience.
This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.
The paper investigates multimodal jailbreak robustness across various reasoning paradigms and finds that explicit image-tool interaction significantly improves safety by guiding the model's internal representations toward safer directions.
This study provides the first systematic measurement of prompt injection attacks in a real-world LLM-based resume screening application, finding that approximately 1% of resumes contain hidden injections.
The paper investigates multimodal jailbreak robustness across various reasoning paradigms and finds that explicit image-tool interaction significantly improves safety by shifting the model's internal representations toward a safety-relevant direction.
Papers
Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening
Mohan Zhang, Yuqi Jia, Zhen Tan, Steven Jiang +3 more
This study provides the first large-scale measurement of prompt injection attacks in real-world LLM-based resume screening, finding that approximately 1% of resumes contain hidden injections.