Moment Retrieval

Moment retrieval aims to pinpoint specific video segments matching a natural language query, bridging the gap between visual and textual information. Recent research heavily utilizes transformer-based architectures, often incorporating techniques like attention mechanisms and multi-modal encoders to improve cross-modal alignment and address challenges such as imprecise queries and noisy video backgrounds. This field is significant for advancing video understanding and has practical applications in video search, summarization, and content analysis, with ongoing efforts to unify moment retrieval with related tasks like temporal action detection.

Papers

August 14, 2023

Knowing Where to Focus: Event-aware Transformer for Video Grounding
Jinhyun Jang, Jungin Park, Jin Kim, Hyeongjun Kwon, Kwanghoon Sohn
Moment Retrieval Video Grounding Event Transformer Moment Query

June 5, 2023

Background-aware Moment Detection for Video Moment Retrieval
Minjoon Jung, Youwon Jang, Seongho Choi, Joochan Kim, Jin-Hwa Kim, Byoung-Tak Zhang
Video Dataset Temporal Moment Video Moment Retrieval Moment Retrieval

May 30, 2023

MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction
Jing Wang, Aixin Sun, Hao Zhang, Xiaoli Li
Open Sampling DETR Training Moment Retrieval Lw Detr Natural Language Video Localization

May 23, 2023

Faster Video Moment Retrieval with Point-Level Supervision
Xun Jiang, Zailei Zhou, Xing Xu, Yang Yang, Guoqing Wang, Heng Tao Shen
Video Moment Retrieval Multimodal Alignment Moment Retrieval Point Supervision Temporal Annotation

May 2, 2023

TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis
Mathis Petrovich, Michael J. Black, Gül Varol
Motion Synthesis Human Motion Synthesis Moment Retrieval Motion Retrieval Affinity Loss

March 24, 2023

Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon, Sangeek Hyun, SangUk Park, Dongchan Park, Jae-Pil Heo
Video Understanding Video Representation Video Moment Retrieval Video Question Moment Retrieval Highlight Detection

March 12, 2023

Towards Diverse Temporal Grounding under Single Positive Labels
Hao Zhou, Chongyang Zhang, Yanjun Chen, Chuanping Hu
Temporal Grounding Moment Retrieval Positive Label Conditional Moment Restriction Single Label Annotation

October 17, 2022

Selective Query-guided Debiasing for Video Corpus Moment Retrieval
Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo
Retrieval Performance Self Debiasing Video Moment Retrieval Selective Engagement Model Debiasing Moment Retrieval Video Corpus Moment Retrieval

May 25, 2022

You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos
Xin Sun, Xuan Wang, Jialin Gao, Qiong Liu, Xi Zhou
Cross Modal Gameplay Video Multi Granularity Moment Retrieval

March 23, 2022

UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection
Ye Liu, Siyuan Li, Yang Wu, Chang Wen Chen, Ying Shan, Xiaohu Qie
Unified Alignment Multi Modal Transformer Video Moment Retrieval Moment Retrieval Highlight Detection