Temporal Action Detection
Temporal action detection (TAD) aims to identify and locate actions within untrimmed videos, a crucial task for video understanding. Current research heavily utilizes transformer-based architectures, like DETR, often focusing on improving temporal modeling through techniques such as refined feature extraction, attention mechanism enhancements to address issues like attention collapse, and incorporating contextual information (e.g., audio, interactions). These advancements are driving progress in various applications, including video summarization, autonomous systems, and ecological monitoring, by enabling more accurate and efficient analysis of complex video data.
Papers
April 6, 2022
March 31, 2022
March 20, 2022
March 16, 2022
March 14, 2022
March 3, 2022
February 8, 2022
December 7, 2021