Bing Hu
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper investigates multimodal jailbreak robustness across various reasoning paradigms and finds that explicit image-tool interaction significantly improves safety by guiding the model's internal representations toward safer directions.
The paper investigates multimodal jailbreak robustness across various reasoning paradigms and finds that explicit image-tool interaction significantly improves safety by shifting the model's internal representations toward a safety-relevant direction.
Papers
When Think-with-Image Meets Safety: What Determines Multimodal Jailbreak Robustness?
Yuan Tian, Bing Hu, Fang Wu, Xiaomin Li +2 more
The paper investigates multimodal jailbreak robustness across various reasoning paradigms and finds that explicit image-tool interaction significantly improves safety by guiding the model's internal r…