Different Modality
Multimodal learning focuses on integrating information from diverse data sources (e.g., text, images, audio) to improve model performance and robustness. Current research emphasizes efficient fusion techniques, addressing challenges like missing modalities through methods such as contrastive learning, modality-aware adaptation, and progressive alignment using lightweight architectures like OneEncoder. This field is significant for advancing AI capabilities in various applications, including medical diagnosis, visual question answering, and human activity recognition, by enabling more comprehensive and reliable analysis of complex data.
Papers
March 10, 2024
February 25, 2024
February 19, 2024
February 18, 2024
February 6, 2024
February 2, 2024
January 29, 2024
January 26, 2024
January 12, 2024
December 31, 2023
December 21, 2023
December 16, 2023
December 6, 2023
November 16, 2023
November 10, 2023
November 9, 2023
November 2, 2023
October 11, 2023