Action Recognition
Action recognition, the task of automatically identifying actions within video data, aims to develop robust and efficient systems for understanding human and animal behavior. Current research focuses on improving accuracy and efficiency across diverse scenarios, employing various model architectures such as transformers, convolutional neural networks, and recurrent neural networks, often incorporating multimodal data (RGB, depth, skeleton, audio) and self-supervised learning techniques. This field is crucial for numerous applications, including autonomous systems, healthcare monitoring, and video surveillance, with ongoing efforts to address challenges like domain generalization, few-shot learning, and adversarial robustness.
Papers
Distribution of Action Movements (DAM): A Descriptor for Human Action Recognition
Facundo Manuel Quiroga, Franco Ronchetti, Laura Lanzarini, Cesar Eestrebou
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting
Tim J. Schoonbeek, Tim Houben, Hans Onvlee, Peter H. N. de With, Fons van der Sommen
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives
Karolina Seweryn, Anna Wróblewska, Szymon Łukasik
Elevating Skeleton-Based Action Recognition with Efficient Multi-Modality Self-Supervision
Yiping Wei, Kunyu Peng, Alina Roitberg, Jiaming Zhang, Junwei Zheng, Ruiping Liu, Yufan Chen, Kailun Yang, Rainer Stiefelhagen