Kavli Affiliate: Li Xin Li | First 5 Authors: Weijun Zhuang, Qizhang Li, Xin Li, Ming Liu, Xiaopeng Hong | Summary: Temporal Action Detection and Moment Retrieval constitute two pivotal tasks in video understanding, focusing on precisely localizing temporal segments corresponding to specific actions or events. Recent advancements introduced Moment Detection to unify these two […]
Continue.. Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection