Visual Search

Visual search, the process of finding a target object amidst distractors, is a fundamental aspect of human and animal vision, with research focusing on understanding the underlying cognitive mechanisms and developing computational models that mimic this ability. Current research emphasizes object-level attention, incorporating semantic information and contextual cues within transformer-based architectures and other deep learning models to improve accuracy and efficiency, particularly in complex scenes and across multiple modalities (e.g., combining visual and textual information). These advancements have implications for improving human-computer interaction, medical image analysis, and robotics, by enabling more efficient and accurate visual information processing in various applications.

Papers