Pinar Yanardag
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
VideoMLA introduces a novel Multi-Head Latent Attention (MLA) mechanism that replaces per-head KV caches with a shared low-rank content latent, significantly reducing memory and improving throughput for autoregressive video diffusion.
The paper introduces SPAWN, a training-free method that allows users to inject specified visual concepts into existing autoregressive world models, enabling controllable scene composition beyond the initial reference frame.
Papers
From Zero to Hero: Training-Free Custom Concept Spawning in World Models
The paper introduces SPAWN, a training-free method that allows users to inject specified visual concepts into existing autoregressive world models, enabling controllable scene composition beyond the i…