Action Localization
Action localization in videos aims to identify both the class and temporal extent of actions within untrimmed video sequences. Current research emphasizes robust methods for handling multiple actions, noisy data, and limited annotations, often employing transformer-based architectures, multimodal approaches (combining visual and textual information), and self-supervised or weakly-supervised learning techniques to improve accuracy and efficiency. This field is crucial for applications ranging from video understanding and content analysis to robotics and assistive technologies, driving advancements in both model design and dataset creation.
Papers
October 18, 2024
October 8, 2024
September 22, 2024
September 4, 2024
August 26, 2024
August 25, 2024
July 9, 2024
June 20, 2024
June 13, 2024
May 4, 2024
April 19, 2024
April 8, 2024
April 2, 2024
March 12, 2024
March 11, 2024
December 29, 2023
November 13, 2023
August 19, 2023