Open Vocabulary Detection
Open-vocabulary detection (OVD) aims to enable object detection systems to identify objects beyond those explicitly included in their training data, addressing the limitations of traditional closed-vocabulary approaches. Current research heavily utilizes vision-language models (VLMs) like CLIP, often integrated into detector architectures such as DETR and YOLO, with a focus on improving feature alignment between visual and textual representations and mitigating noise from pseudo-labeling techniques. This field is significant because it pushes the boundaries of object recognition towards more robust and generalizable systems, with potential applications in various domains requiring real-world adaptability and handling of unseen objects.
Papers
October 8, 2024
September 13, 2024
July 31, 2024
July 16, 2024
July 10, 2024
May 28, 2024
April 14, 2024
March 26, 2024
February 7, 2024
January 30, 2024
January 4, 2024
October 20, 2023
September 29, 2023
September 1, 2023
August 25, 2023
July 18, 2023
May 11, 2023
April 25, 2023
April 22, 2023