Moment Retrieval
Moment retrieval aims to pinpoint specific video segments matching a natural language query, bridging the gap between visual and textual information. Recent research heavily utilizes transformer-based architectures, often incorporating techniques like attention mechanisms and multi-modal encoders to improve cross-modal alignment and address challenges such as imprecise queries and noisy video backgrounds. This field is significant for advancing video understanding and has practical applications in video search, summarization, and content analysis, with ongoing efforts to unify moment retrieval with related tasks like temporal action detection.
Papers
October 2, 2024
August 14, 2024
July 9, 2024
June 26, 2024
June 10, 2024
May 21, 2024
April 7, 2024
March 4, 2024
March 3, 2024
February 21, 2024
December 11, 2023
November 30, 2023
November 28, 2023
August 29, 2023
August 14, 2023
June 5, 2023
May 30, 2023
May 23, 2023
May 2, 2023