Video Task
Video task research focuses on developing computational methods to analyze and understand video content, encompassing diverse objectives like action recognition, video question answering, and object tracking. Current efforts concentrate on leveraging large language models (LLMs) combined with visual feature extraction techniques, often employing transformer-based architectures and self-supervised learning strategies to improve efficiency and accuracy. This field is significant for its potential to advance various applications, including medical diagnosis (e.g., Parkinson's detection), autonomous driving, and improved accessibility to educational resources through instructional video comprehension.
Papers
June 26, 2024
June 21, 2024
June 14, 2024
March 25, 2024
December 21, 2023
December 18, 2023
December 1, 2023
November 20, 2023
October 24, 2023
October 8, 2023
September 25, 2023
September 8, 2023
August 21, 2023
August 8, 2023
July 26, 2023
March 30, 2023
March 8, 2023
December 15, 2022
December 13, 2022