Object Level
Object-level understanding in computer vision aims to represent and reason about individual objects within scenes, moving beyond simple object detection to encompass their properties, relationships, and interactions. Current research heavily utilizes transformer-based architectures, often incorporating multi-modal learning (combining visual and textual data) and leveraging techniques like knowledge distillation and contrastive learning to improve model performance and generalization. This focus on object-centric representation is crucial for advancing applications such as autonomous driving, robotics, and image understanding, enabling more robust and context-aware systems.
Papers
December 1, 2023
November 21, 2023
November 18, 2023
October 30, 2023
October 18, 2023
July 9, 2023
May 24, 2023
May 3, 2023
April 9, 2023
April 7, 2023
March 27, 2023
March 10, 2023
February 3, 2023
January 31, 2023
December 26, 2022
December 6, 2022
December 1, 2022
November 21, 2022
November 9, 2022