Masked Image
Masked image modeling (MIM) is a self-supervised learning technique that trains computer vision models by reconstructing masked portions of images, leveraging unlabeled data to learn robust feature representations. Current research focuses on improving MIM's efficiency and effectiveness through architectural innovations like incorporating structured knowledge, interactive masking strategies, and multi-modal data fusion, often within transformer or convolutional neural network frameworks. This approach holds significant promise for advancing various computer vision tasks, particularly in domains with limited labeled data, such as medical image analysis and remote sensing, by enabling the pre-training of powerful models.
Papers
December 26, 2024
November 24, 2024
November 13, 2024
October 14, 2024
September 20, 2024
September 13, 2024
September 4, 2024
July 18, 2024
May 23, 2024
March 23, 2024
February 15, 2024
June 29, 2023
June 18, 2023
February 4, 2023
December 31, 2022
December 29, 2022