Online Video
Online video research focuses on bridging the semantic gap between video content and textual or other modalities, enabling improved retrieval, analysis, and generation. Current efforts concentrate on developing multimodal models, often employing transformer architectures and leveraging large-scale datasets from sources like YouTube, to achieve tasks such as video-to-music generation, enhanced text-video retrieval, and automatic content labeling. These advancements have significant implications for applications ranging from personalized content recommendation and improved search functionality to enabling more sophisticated robotic manipulation and facilitating the understanding of animal communication through video analysis.
Papers
November 17, 2024
October 11, 2024
October 10, 2024
September 11, 2024
August 14, 2024
June 12, 2024
May 15, 2024
May 2, 2024
March 21, 2024
March 3, 2024
February 16, 2024
September 22, 2023
September 21, 2023
December 8, 2022
November 23, 2022
November 15, 2022
May 25, 2022
May 10, 2022
March 30, 2022