Video Recognition
Video recognition aims to automatically understand the content of videos, a complex task requiring the analysis of both spatial and temporal information. Current research focuses on improving efficiency and robustness, exploring architectures like transformers and convolutional neural networks, often incorporating techniques like masked autoencoders, attention mechanisms, and efficient positional encoding to handle the high dimensionality of video data. These advancements are crucial for applications ranging from autonomous driving and medical image analysis to content understanding and security, driving progress in both theoretical understanding and practical deployment of video analysis systems.
Papers
October 14, 2024
October 3, 2024
August 27, 2024
August 12, 2024
August 1, 2024
July 11, 2024
July 3, 2024
June 11, 2024
May 28, 2024
May 14, 2024
May 8, 2024
April 17, 2024
January 10, 2024
December 21, 2023
December 18, 2023
December 15, 2023
November 30, 2023
October 16, 2023