Cross Modal Interaction
Cross-modal interaction research focuses on effectively integrating information from different data modalities (e.g., text, images, audio) to improve the performance of AI systems. Current research emphasizes developing novel architectures, such as multimodal transformers and graph neural networks, and innovative training paradigms like cross-modal denoising and alternating unimodal adaptation, to achieve better cross-modal alignment and feature fusion. This field is significant because improved cross-modal understanding is crucial for advancing applications in diverse areas, including image segmentation, robotics, and medical diagnosis, by enabling AI systems to process and interpret richer, more nuanced information.
Papers
December 27, 2023
December 11, 2023
November 17, 2023
November 9, 2023
November 8, 2023
November 1, 2023
October 26, 2023
October 19, 2023
August 20, 2023
August 7, 2023
July 28, 2023
May 23, 2023
May 15, 2023
March 14, 2023
February 20, 2023
November 27, 2022
November 17, 2022
November 9, 2022
October 17, 2022