Multimodal Network
Multimodal networks integrate information from diverse data sources (e.g., text, images, audio) to improve performance on complex tasks compared to single-modality approaches. Current research emphasizes developing robust architectures, such as those employing transformer networks, that handle missing modalities and efficiently fuse information from different sources, including through techniques like early and late fusion, and dynamic fusion strategies. This field is significant for advancing artificial intelligence, particularly in applications like emotion recognition, action recognition, and medical diagnosis, where integrating multiple data types can lead to more accurate and reliable results.
Papers
December 23, 2024
December 19, 2024
November 12, 2024
November 5, 2024
October 18, 2024
September 13, 2024
August 14, 2024
August 10, 2024
August 5, 2024
June 27, 2024
June 20, 2024
May 16, 2024
May 14, 2024
March 27, 2024
March 12, 2024
January 16, 2024
December 17, 2023
December 1, 2023
October 6, 2023