Twenty Thousand Class
"Twenty-thousand class" research focuses on expanding the capabilities of computer vision models to recognize and segment a vastly increased number of object categories, significantly exceeding the limitations of traditional datasets. Current efforts involve integrating powerful segmentation models like SAM with zero-shot recognition models like CLIP, and developing novel algorithms like Invariant-Feature Subspace Recovery (ISR) to improve generalization across diverse visual domains. This research is crucial for advancing open-vocabulary visual recognition, enabling applications such as more robust object detection, improved image understanding, and more accurate scene interpretation in various fields.
Papers
March 4, 2024
January 5, 2024
November 2, 2023
May 22, 2023
April 28, 2023
April 10, 2023