Temporal Action Detection
Temporal action detection (TAD) aims to identify and locate actions within untrimmed videos, a crucial task for video understanding. Current research heavily utilizes transformer-based architectures, like DETR, often focusing on improving temporal modeling through techniques such as refined feature extraction, attention mechanism enhancements to address issues like attention collapse, and incorporating contextual information (e.g., audio, interactions). These advancements are driving progress in various applications, including video summarization, autonomous systems, and ecological monitoring, by enabling more accurate and efficient analysis of complex video data.
Papers
November 28, 2023
November 1, 2023
October 10, 2023
September 11, 2023
September 3, 2023
September 1, 2023
August 21, 2023
August 18, 2023
August 3, 2023
April 24, 2023
April 10, 2023
April 6, 2023
April 1, 2023
March 30, 2023
March 27, 2023
March 13, 2023
February 14, 2023
January 3, 2023
November 27, 2022