Visual Representation
Visual representation research focuses on creating effective ways for computers to understand and utilize visual information, primarily aiming to bridge the gap between raw image data and higher-level semantic understanding. Current research emphasizes developing robust and efficient visual representations through various techniques, including contrastive learning, masked image modeling, and the integration of vision models with large language models (LLMs), often employing transformer-based architectures. These advancements have significant implications for numerous applications, such as robotic control, medical image analysis, and improving the capabilities of multimodal AI systems.
Papers
April 18, 2022
April 15, 2022
April 2, 2022
March 29, 2022
March 23, 2022
March 17, 2022
March 16, 2022
March 15, 2022
March 11, 2022
February 28, 2022
February 22, 2022
February 18, 2022
February 14, 2022
February 11, 2022
February 10, 2022
January 30, 2022
January 20, 2022