Video Representation Learning
Video representation learning aims to automatically extract meaningful features from video data, enabling computers to understand and analyze visual information in sequences. Current research heavily emphasizes self-supervised learning methods, often employing transformer-based architectures or contrastive learning approaches, to overcome the limitations of expensive manual annotation. These advancements are improving performance across various downstream tasks, including action recognition, video retrieval, and scene understanding, with significant implications for applications like video surveillance, autonomous driving, and content-based video search.
Papers
December 14, 2024
July 19, 2024
May 21, 2024
May 10, 2024
January 9, 2024
December 13, 2023
November 27, 2023
November 22, 2023
November 17, 2023
November 15, 2023
September 18, 2023
July 17, 2023
June 9, 2023
March 31, 2023
January 2, 2023
December 28, 2022
December 21, 2022
December 8, 2022