Large Scale Vision Model
Large-scale vision models (LVMs) are deep learning systems designed for robust and versatile visual processing across diverse tasks, aiming to match or exceed human capabilities in image understanding. Current research emphasizes improving LVM robustness to distribution shifts and adversarial attacks, developing efficient training and compression techniques (like knowledge distillation and autoregressive methods), and exploring their application in areas such as robotics and medical imaging. These advancements are significant because they enable the deployment of powerful vision systems in resource-constrained environments and unlock new possibilities in various fields, from autonomous systems to creative applications.
Papers
September 9, 2024
July 17, 2024
July 5, 2024
June 17, 2024
April 23, 2024
March 30, 2024
March 18, 2024
February 7, 2024
January 16, 2024
November 27, 2023
April 24, 2023
March 1, 2023
December 20, 2022
May 29, 2022