Visual Representation Learning
Visual representation learning aims to create effective numerical representations of images, enabling computers to "understand" and process visual information. Current research heavily focuses on self-supervised learning methods, leveraging architectures like Vision Transformers (ViTs) and convolutional neural networks (CNNs), often incorporating contrastive learning, masked image modeling, and techniques like prompt tuning to improve representation quality. These advancements are driving progress in diverse applications, including image classification, object detection, medical image analysis, and robotic manipulation, by providing more robust and generalizable visual features.
Papers
December 20, 2022
December 6, 2022
December 2, 2022
December 1, 2022
November 24, 2022
November 16, 2022
November 10, 2022
October 18, 2022
September 14, 2022
September 7, 2022
August 25, 2022
July 29, 2022
July 26, 2022
July 11, 2022
June 26, 2022
June 25, 2022
June 16, 2022
June 7, 2022
June 6, 2022
May 16, 2022