Real2SAM2Real: Generative 3D Caches as Complementary Context for Video Diffusion | ArxivCSExplorer