Action Recognition
Action recognition, the task of automatically identifying actions within video data, aims to develop robust and efficient systems for understanding human and animal behavior. Current research focuses on improving accuracy and efficiency across diverse scenarios, employing various model architectures such as transformers, convolutional neural networks, and recurrent neural networks, often incorporating multimodal data (RGB, depth, skeleton, audio) and self-supervised learning techniques. This field is crucial for numerous applications, including autonomous systems, healthcare monitoring, and video surveillance, with ongoing efforts to address challenges like domain generalization, few-shot learning, and adversarial robustness.
Papers
EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond
Meiqi Cao, Xiangbo Shu, Jiachao Zhang, Rui Yan, Zechao Li, Jinhui Tang
An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition
Song-Jiang Lai, Tsun-Hin Cheung, Ka-Chun Fung, Tian-Shan Liu, Kin-Man Lam
AM Flow: Adapters for Temporal Processing in Action Recognition
Tanay Agrawal, Abid Ali, Antitza Dantcheva, Francois Bremond
ARN-LSTM: A Multi-Stream Fusion Model for Skeleton-based Action Recognition
Chuanchuan Wang, Ahmad Sufril Azlan Mohmamed, Mohd Halim Bin Mohd Noor, Xiao Yang, Feifan Yi, Xiang Li