Deep Vision Model
Deep vision models, primarily convolutional neural networks (CNNs) and vision transformers (ViTs), aim to enable computers to "see" and understand images and videos, achieving human-level performance in tasks like object recognition and video analysis. Current research heavily emphasizes improving model explainability, focusing on techniques like class activation maps and concept-based explanations to understand model decision-making processes and address the "black box" nature of deep learning. This work is crucial for building trust in these models, particularly in high-stakes applications like autonomous driving and medical image analysis, and for developing more robust and efficient architectures.
Papers
December 1, 2024
October 25, 2024
October 18, 2024
September 30, 2024
September 20, 2024
September 8, 2024
August 15, 2024
July 27, 2024
July 1, 2024
April 22, 2024
April 18, 2024
April 1, 2024
March 28, 2024
February 14, 2024
October 25, 2023
October 16, 2023
September 27, 2023
August 10, 2023
July 19, 2023