Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration | ArxivCSExplorer