Visual Representation
Visual representation research focuses on creating effective ways for computers to understand and utilize visual information, primarily aiming to bridge the gap between raw image data and higher-level semantic understanding. Current research emphasizes developing robust and efficient visual representations through various techniques, including contrastive learning, masked image modeling, and the integration of vision models with large language models (LLMs), often employing transformer-based architectures. These advancements have significant implications for numerous applications, such as robotic control, medical image analysis, and improving the capabilities of multimodal AI systems.
Papers
July 14, 2023
July 6, 2023
June 6, 2023
June 2, 2023
June 1, 2023
May 29, 2023
May 26, 2023
May 23, 2023
May 22, 2023
May 21, 2023
May 11, 2023
May 10, 2023
April 28, 2023
April 21, 2023
April 17, 2023
April 13, 2023
April 10, 2023
April 7, 2023
March 22, 2023
March 21, 2023