Visual Representation Learning
Visual representation learning aims to create effective numerical representations of images, enabling computers to "understand" and process visual information. Current research heavily focuses on self-supervised learning methods, leveraging architectures like Vision Transformers (ViTs) and convolutional neural networks (CNNs), often incorporating contrastive learning, masked image modeling, and techniques like prompt tuning to improve representation quality. These advancements are driving progress in diverse applications, including image classification, object detection, medical image analysis, and robotic manipulation, by providing more robust and generalizable visual features.
Papers
September 18, 2023
September 9, 2023
August 8, 2023
July 20, 2023
June 28, 2023
June 8, 2023
June 1, 2023
May 23, 2023
May 18, 2023
April 13, 2023
April 6, 2023
March 15, 2023
March 14, 2023
February 24, 2023
February 23, 2023
January 29, 2023
January 28, 2023
January 26, 2023
December 30, 2022
December 20, 2022