Modality Fusion
Modality fusion aims to combine information from multiple data sources (e.g., images, text, audio) to improve the performance and robustness of machine learning models. Current research focuses on developing effective fusion strategies, often employing transformer networks, graph neural networks, or state space models, and exploring optimal fusion points within model architectures to address issues like modality misalignment and incomplete data. This field is significant because it enables more comprehensive and accurate analysis of complex data, with applications ranging from improved medical diagnosis and plant identification to enhanced human-computer interaction and video understanding.
Papers
December 10, 2024
October 11, 2024
July 17, 2024
June 25, 2024
June 4, 2024
June 3, 2024
May 28, 2024
March 30, 2024
March 27, 2024
March 7, 2024
February 8, 2024
January 30, 2024
November 3, 2023
October 19, 2023
September 15, 2023
July 31, 2023
June 15, 2023
June 1, 2023
December 29, 2022