Compositional Action Recognition
Compositional action recognition aims to enable machines to understand and classify actions as combinations of simpler actions or object interactions, mirroring human cognitive abilities. Current research focuses on developing models that can generalize to unseen action combinations, often employing techniques like component-to-composition learning, multimodal knowledge distillation, and spatio-temporal interaction modeling using attention mechanisms and relational networks. These advancements are significant because they improve the robustness and generalization capabilities of action recognition systems, paving the way for more versatile applications in areas like video understanding and human-computer interaction.
Papers
July 8, 2024
September 25, 2023
May 4, 2023
December 21, 2022
October 9, 2022
July 4, 2022