Action Localization
Action localization in videos aims to identify both the class and temporal extent of actions within untrimmed video sequences. Current research emphasizes robust methods for handling multiple actions, noisy data, and limited annotations, often employing transformer-based architectures, multimodal approaches (combining visual and textual information), and self-supervised or weakly-supervised learning techniques to improve accuracy and efficiency. This field is crucial for applications ranging from video understanding and content analysis to robotics and assistive technologies, driving advancements in both model design and dataset creation.
Papers
May 23, 2023
March 30, 2023
March 22, 2023
March 21, 2023
December 19, 2022
July 14, 2022
July 8, 2022
July 5, 2022
June 23, 2022
May 12, 2022
April 6, 2022
March 25, 2022
March 20, 2022
February 10, 2022
January 2, 2022
December 8, 2021
December 1, 2021
November 24, 2021
November 14, 2021