Video Search
Video search aims to efficiently retrieve relevant video clips based on textual or visual queries, a task complicated by the multimodal nature of video data. Current research focuses on improving cross-modal embedding techniques, often employing transformer-based architectures and generative models to better align textual queries with visual and audio features, addressing issues like the "modality gap" and vocabulary limitations. These advancements are crucial for enhancing the performance of video search engines and enabling more sophisticated applications like temporal grounding and interactive video retrieval, impacting fields ranging from advertising to archival research.
Papers
December 1, 2024
April 9, 2024
March 25, 2024
February 19, 2024
November 15, 2023
September 18, 2023
June 28, 2023
June 22, 2023
June 15, 2023
February 19, 2023
November 28, 2022
November 21, 2022
September 19, 2022
July 12, 2022