Vision Foundation Model
Vision foundation models (VFMs) are large-scale, pre-trained models designed to learn robust visual representations applicable across diverse downstream tasks, reducing the need for extensive task-specific training data. Current research emphasizes improving VFM efficiency and generalization through techniques like continual learning, semi-supervised fine-tuning, and knowledge distillation, often employing transformer-based architectures such as Vision Transformers (ViTs) and adapting them for specific applications like medical image analysis and autonomous driving. This work is significant because VFMs offer a more efficient and generalizable approach to computer vision, potentially accelerating progress in various fields by reducing the reliance on massive, task-specific datasets and enabling more robust and adaptable AI systems.
Papers
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei, Lin Chen, Yi Jin, Xiaoxiao Ma, Tianle Liu, Pengyang Ling, Ben Wang, Huaian Chen, Jinjin Zheng
Fine-tuning vision foundation model for crack segmentation in civil infrastructures
Kang Ge, Chen Wang, Yutao Guo, Yansong Tang, Zhenzhong Hu, Hongbing Chen
Evaluating General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Mohammed Baharoon, Waseem Qureshi, Jiahong Ouyang, Yanwu Xu, Abdulrhman Aljouie, Wei Peng
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao, Zhan Tong, Kevin Qinghong Lin, Joya Chen, Mike Zheng Shou
BioCLIP: A Vision Foundation Model for the Tree of Life
Samuel Stevens, Jiaman Wu, Matthew J Thompson, Elizabeth G Campolongo, Chan Hee Song, David Edward Carlyn, Li Dong, Wasila M Dahdul, Charles Stewart, Tanya Berger-Wolf, Wei-Lun Chao, Yu Su
Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models
Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel