Gaze Object

Gaze object prediction (GOP) focuses on identifying the object a person is looking at, a crucial aspect of human-computer interaction and human-robot collaboration. Current research emphasizes improving accuracy and efficiency, particularly by incorporating pixel-level supervision from vision foundation models and developing unified, single-stage detection frameworks like transformers that simultaneously locate the gaze and the object. These advancements are driving progress in applications ranging from online exam proctoring to enhancing human-robot interaction, where understanding gaze direction facilitates more natural and intuitive collaboration.

Papers