Action Detection
Action detection in videos focuses on identifying and precisely locating actions within video streams, addressing challenges like cluttered scenes and varying action durations. Current research emphasizes the development of robust and efficient models, often employing transformer architectures and incorporating multi-modal data (RGB, depth, audio, skeleton data) to improve accuracy and handle diverse action types. This field is crucial for various applications, including sports analytics, educational research, and surveillance systems, driving advancements in video understanding and enabling the development of AI-driven tools for diverse sectors.
Papers
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang
ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding
Dustin Aganian, Benedict Stephan, Markus Eisenbach, Corinna Stretz, Horst-Michael Gross