Different Modality
Multimodal learning focuses on integrating information from diverse data sources (e.g., text, images, audio) to improve model performance and robustness. Current research emphasizes efficient fusion techniques, addressing challenges like missing modalities through methods such as contrastive learning, modality-aware adaptation, and progressive alignment using lightweight architectures like OneEncoder. This field is significant for advancing AI capabilities in various applications, including medical diagnosis, visual question answering, and human activity recognition, by enabling more comprehensive and reliable analysis of complex data.
Papers
September 25, 2023
September 21, 2023
August 16, 2023
August 8, 2023
July 18, 2023
July 11, 2023
June 12, 2023
June 7, 2023
May 25, 2023
May 10, 2023
May 3, 2023
April 5, 2023
March 27, 2023
March 17, 2023
March 12, 2023
February 4, 2023
February 2, 2023
January 12, 2023
December 31, 2022