Visual Attention
Visual attention research investigates how humans and animals selectively process visual information, aiming to understand the mechanisms underlying this crucial cognitive function and replicate it computationally. Current research focuses on developing models that integrate multiple sensory modalities (audio-visual), leverage object-level attention rather than pixel-level, and incorporate human gaze data for improved accuracy and interpretability, often employing transformer networks, spiking neural networks, and other deep learning architectures. These advancements have implications for various fields, including computer vision, human-computer interaction, and medical image analysis, by enabling more efficient and robust systems for tasks such as object tracking, speech recognition, and medical diagnosis.
Papers
A domain adaptive deep learning solution for scanpath prediction of paintings
Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Alessandro Bruno
A Spatial-channel-temporal-fused Attention for Spiking Neural Networks
Wuque Cai, Hongze Sun, Rui Liu, Yan Cui, Jun Wang, Yang Xia, Dezhong Yao, Daqing Guo
On Guiding Visual Attention with Language Specification
Suzanne Petryk, Lisa Dunlap, Keyan Nasseri, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach
Visual attention analysis of pathologists examining whole slide images of Prostate cancer
Souradeep Chakraborty, Ke Ma, Rajarsi Gupta, Beatrice Knudsen, Gregory J. Zelinsky, Joel H. Saltz, Dimitris Samaras