Visual Model
Visual models are computational systems designed to process and understand visual information, aiming to replicate or surpass human-level visual perception and reasoning. Current research emphasizes improving model accuracy, interpretability, and robustness through techniques like dilated convolutions with learnable spacings, adaptive model selection based on context, and the integration of visual models with large language models for multimodal tasks. These advancements are significant for various applications, including video production, remote sensing, medical imaging, and autonomous navigation, driving progress in both scientific understanding and practical technological capabilities.
Papers
August 30, 2024
August 6, 2024
July 29, 2024
June 14, 2024
April 29, 2024
January 17, 2024
December 21, 2023
November 6, 2023
October 12, 2023
September 20, 2023
September 11, 2023
May 30, 2023
May 23, 2023
May 10, 2023
March 28, 2023
March 8, 2023
January 17, 2023
November 4, 2022
October 31, 2022