Saliency Map
Saliency maps are visual representations highlighting the most influential regions of an input (e.g., image, video, audio) for a model's prediction, aiming to improve the interpretability of "black box" models like deep neural networks. Current research focuses on developing more accurate and robust saliency map generation methods, often employing gradient-based techniques, transformer architectures, and diffusion models, and exploring their application across diverse data modalities (images, videos, audio, time series). These advancements are crucial for enhancing trust and understanding in AI systems, particularly in high-stakes applications like medical diagnosis and autonomous driving, by providing insights into model decision-making processes.
Papers
TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation
Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness
VS-Net: Multiscale Spatiotemporal Features for Lightweight Video Salient Document Detection
Hemraj Singh, Mridula Verma, Ramalingaswamy Cheruku