Minglei Yang
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
Vision×1AI×1
Frequent co-authors
Research Timeline
2026
Mining Multi-Modality Spatio-Temporal Cues for Video Important Person Identification
The paper introduces VIP-Net, a framework that leverages multi-modal spatio-temporal cues and a new dataset (Temporal-VIP) to accurately identify the most influential people in videos, overcoming the challenge of Temporal Importance Shift (TIS).
Highlighted terms show continued research focus across papers
Papers
cs.CVcs.AIRecentMay 27, 2026
Mining Multi-Modality Spatio-Temporal Cues for Video Important Person Identification
Xiao Wang, Minglei Yang, Bin Yang, Wenke Huang +3 more
The paper introduces VIP-Net, a framework that leverages multi-modal spatio-temporal cues and a new dataset (Temporal-VIP) to accurately identify the most influential people in videos, overcoming the…
View →