Action Inference

Action inference, the process of determining the actions that led to an observed state, is a crucial area of research with applications in robotics and human-computer interaction. Current work focuses on developing models, including transformers, graph neural networks, and vision transformers, to infer actions from various data modalities like point clouds, video, and multi-modal inputs, often leveraging techniques like trajectory stitching and generative models to improve accuracy and efficiency. These advancements are improving robot manipulation, human action recognition, and the understanding of implicit communication in human-robot interaction, ultimately leading to more robust and adaptable intelligent systems.

Papers