Action Recognition
Action recognition, the task of automatically identifying actions within video data, aims to develop robust and efficient systems for understanding human and animal behavior. Current research focuses on improving accuracy and efficiency across diverse scenarios, employing various model architectures such as transformers, convolutional neural networks, and recurrent neural networks, often incorporating multimodal data (RGB, depth, skeleton, audio) and self-supervised learning techniques. This field is crucial for numerous applications, including autonomous systems, healthcare monitoring, and video surveillance, with ongoing efforts to address challenges like domain generalization, few-shot learning, and adversarial robustness.
Papers
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
Lucian Bicsi, Bogdan Alexe, Radu Tudor Ionescu, Marius Leordeanu
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
Qiang Wang, Junlong Du, Ke Yan, Shouhong Ding
Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition
Yuhang Wen, Zixuan Tang, Yunsheng Pang, Beichen Ding, Mengyuan Liu
One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching
Siyuan Yang, Jun Liu, Shijian Lu, Er Meng Hwa, Alex C. Kot
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli
Free-Form Composition Networks for Egocentric Action Recognition
Haoran Wang, Qinghua Cheng, Baosheng Yu, Yibing Zhan, Dapeng Tao, Liang Ding, Haibin Ling