Visual Exploration

Visual exploration research focuses on how agents, both human and artificial, efficiently and effectively gather information from visual data. Current efforts concentrate on developing models that predict human gaze patterns during tasks like captioning, employing reinforcement learning and other techniques to optimize exploration strategies in robotics and virtual environments, and leveraging large language and vision models to improve understanding and reasoning from visual scenes. This field is crucial for advancing artificial intelligence, particularly in robotics and computer vision, and for gaining deeper insights into human visual perception and cognitive processes.

Papers