Different Modality
Multimodal learning focuses on integrating information from diverse data sources (e.g., text, images, audio) to improve model performance and robustness. Current research emphasizes efficient fusion techniques, addressing challenges like missing modalities through methods such as contrastive learning, modality-aware adaptation, and progressive alignment using lightweight architectures like OneEncoder. This field is significant for advancing AI capabilities in various applications, including medical diagnosis, visual question answering, and human activity recognition, by enabling more comprehensive and reliable analysis of complex data.
Papers
December 13, 2022
November 17, 2022
November 15, 2022
November 7, 2022
November 1, 2022
October 25, 2022
October 22, 2022
October 20, 2022
October 17, 2022
September 30, 2022
July 28, 2022
July 20, 2022
July 3, 2022
June 14, 2022
May 19, 2022
May 12, 2022
April 30, 2022
April 13, 2022
April 12, 2022
April 11, 2022