Huadong Ma
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
VidPrism introduces a novel heterogeneous Mixture-of-Experts framework that specializes temporal processing by dividing labor among experts, achieving state-of-the-art performance in image-to-video transfer.
The paper proposes a question-aware evidence ledger pipeline that significantly improves video relational reasoning by explicitly guiding the model to extract necessary evidence for complex spatial, temporal, and dialogue inferences.
Papers
Question-Aware Evidence Ledgers for Video Relational Reasoning
The paper proposes a question-aware evidence ledger pipeline that significantly improves video relational reasoning by explicitly guiding the model to extract necessary evidence for complex spatial, t…