Next Active Object
Next Active Object (NAO) prediction focuses on anticipating the object a person will interact with next in egocentric videos, a crucial step in understanding human-object interactions. Current research employs transformer-based architectures, often incorporating multi-modal data and guided attention mechanisms to improve accuracy in predicting both the object's identity and its future location and the timing of the interaction. This research is significant for advancing computer vision and robotics, with applications ranging from improving human-robot interaction to creating more context-aware assistive technologies and enhancing virtual and augmented reality experiences.
Papers
October 22, 2024
July 8, 2024
March 20, 2024
February 1, 2024
October 25, 2023
September 6, 2023
August 16, 2023
July 3, 2023
May 25, 2023
May 22, 2023
February 28, 2023
February 13, 2023
September 25, 2022
September 12, 2022
March 3, 2022