Richer RGB Infrared Feature
Richer RGB-infrared feature integration aims to combine the complementary information from visible and infrared images for improved scene understanding and downstream tasks like object detection and tracking. Current research focuses on developing novel deep learning architectures, including variations of UNets, transformers, and generative adversarial networks (GANs), often incorporating attention mechanisms and multi-scale feature fusion strategies to enhance both visual quality and information preservation. These advancements are significant for applications requiring robust vision in challenging conditions, such as low-light environments or fire scenarios, and contribute to the broader field of multimodal image fusion.
Papers
Efficient Deep Learning Models for Privacy-preserving People Counting on Low-resolution Infrared Arrays
Chen Xie, Francesco Daghero, Yukai Chen, Marco Castellano, Luca Gandolfi, Andrea Calimera, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari
Breaking Modality Disparity: Harmonized Representation for Infrared and Visible Image Registration
Zhiying Jiang, Zengxi Zhang, Jinyuan Liu, Xin Fan, Risheng Liu