Action Segmentation

Action segmentation aims to automatically divide videos into temporally contiguous segments, each corresponding to a distinct action. Current research heavily utilizes transformer-based architectures, often incorporating techniques like attention mechanisms and efficient feature encoding to improve accuracy and reduce computational cost, particularly for long videos. This field is crucial for applications ranging from video understanding and human-robot interaction to automated analysis of animal behavior and surgical procedures, driving advancements in both algorithmic efficiency and the development of new datasets for evaluation.

Papers

February 3, 2022

Skeleton-Based Action Segmentation with Multi-Stage Spatial-Temporal Graph Convolutional Neural Networks
Benjamin Filtjens, Bart Vanrumste, Peter Slaets
Multi Stage Action Segmentation Spatial Temporal Graph Temporal Convolution Human Motion Analysis Skeleton Based Action Segmentation

January 14, 2022

Transformers in Action: Weakly Supervised Action Segmentation
John Ridley, Huseyin Coskun, David Joseph Tan, Nassir Navab, Federico Tombari
Transformer Megatron Decepticons Weakly Supervised Action Feature Action Segmentation Action Duration Semi Supervised Temporal Action Segmentation

November 22, 2021

Towards Tokenized Human Dynamics Representation
Kenneth Li, Xiao Sun, Zhirong Wu, Fangyun Wei, Stephen Lin
Action Segmentation Action Understanding Frame Wise Representation

Action Segmentation

Papers

Skeleton-Based Action Segmentation with Multi-Stage Spatial-Temporal Graph Convolutional Neural Networks

Transformers in Action: Weakly Supervised Action Segmentation

Towards Tokenized Human Dynamics Representation