When Think-with-Image Meets Safety: What Determines Multimodal Jailbreak Robustness? | ArxivCSExplorer