Moment Retrieval
Moment retrieval aims to pinpoint specific video segments matching a natural language query, bridging the gap between visual and textual information. Recent research heavily utilizes transformer-based architectures, often incorporating techniques like attention mechanisms and multi-modal encoders to improve cross-modal alignment and address challenges such as imprecise queries and noisy video backgrounds. This field is significant for advancing video understanding and has practical applications in video search, summarization, and content analysis, with ongoing efforts to unify moment retrieval with related tasks like temporal action detection.
Papers
December 30, 2024
December 18, 2024
December 2, 2024
November 28, 2024
November 21, 2024
October 2, 2024
August 14, 2024
July 9, 2024
June 26, 2024
June 10, 2024
May 21, 2024
April 7, 2024
March 4, 2024
March 3, 2024
February 21, 2024
December 11, 2023
November 30, 2023
November 28, 2023