Video Recognition
Video recognition aims to automatically understand the content of videos, a complex task requiring the analysis of both spatial and temporal information. Current research focuses on improving efficiency and robustness, exploring architectures like transformers and convolutional neural networks, often incorporating techniques like masked autoencoders, attention mechanisms, and efficient positional encoding to handle the high dimensionality of video data. These advancements are crucial for applications ranging from autonomous driving and medical image analysis to content understanding and security, driving progress in both theoretical understanding and practical deployment of video analysis systems.
Papers
May 16, 2022
May 3, 2022
March 25, 2022
March 18, 2022
March 16, 2022
January 20, 2022
January 17, 2022
January 12, 2022
January 11, 2022
January 9, 2022
December 28, 2021
December 9, 2021
December 2, 2021