Action Detection
Action detection in videos focuses on identifying and precisely locating actions within video streams, addressing challenges like cluttered scenes and varying action durations. Current research emphasizes the development of robust and efficient models, often employing transformer architectures and incorporating multi-modal data (RGB, depth, audio, skeleton data) to improve accuracy and handle diverse action types. This field is crucial for various applications, including sports analytics, educational research, and surveillance systems, driving advancements in video understanding and enabling the development of AI-driven tools for diverse sectors.
Papers
TIM: A Time Interval Machine for Audio-Visual Action Recognition
Jacob Chalk, Jaesung Huh, Evangelos Kazakos, Andrew Zisserman, Dima Damen
T-DEED: Temporal-Discriminability Enhancer Encoder-Decoder for Precise Event Spotting in Sports Videos
Artur Xarles, Sergio Escalera, Thomas B. Moeslund, Albert Clapés