Qianqian Xu
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper proposes a training-free framework, Visual Representation-Guided Video-LLM Reasoning, to perform composed video retrieval by using visual examples and text instructions, achieving strong performance on the CVPR 2026 challenge.
The paper proposes an Understanding-Enhanced Model Collaboration Method (UE-MCM) to accurately detect subtle and rare mistakes in egocentric videos by combining coarse-grained workflow understanding with fine-grained action reasoning.
Papers
Training-Free Composed Video Retrieval via Visual Representation-Guided Video-LLM Reasoning
Yang Liu, Qianqian Xu, Peisong Wen, Siran Dai +1 more
The paper proposes a training-free framework, Visual Representation-Guided Video-LLM Reasoning, to perform composed video retrieval by using visual examples and text instructions, achieving strong per…