Video Representation Learning
Video representation learning aims to automatically extract meaningful features from video data, enabling computers to understand and analyze visual information in sequences. Current research heavily emphasizes self-supervised learning methods, often employing transformer-based architectures or contrastive learning approaches, to overcome the limitations of expensive manual annotation. These advancements are improving performance across various downstream tasks, including action recognition, video retrieval, and scene understanding, with significant implications for applications like video surveillance, autonomous driving, and content-based video search.
Papers
November 25, 2022
November 19, 2022
November 12, 2022
August 15, 2022
August 12, 2022
July 1, 2022
June 21, 2022
June 16, 2022
April 10, 2022
April 8, 2022
March 30, 2022
March 25, 2022
January 23, 2022
January 11, 2022
December 11, 2021
December 7, 2021