Open Vocabulary Instance Segmentation
Open-vocabulary instance segmentation aims to automatically identify and delineate objects in images and videos, even those not seen during model training, going beyond the limitations of traditional closed-vocabulary methods. Current research focuses on integrating 2D and 3D data streams, leveraging vision-language models and diffusion techniques to improve accuracy and handle diverse object appearances, including challenging scenarios like camouflage. These advancements are significant for broader applications in scene understanding, robotic perception, and augmented reality, reducing the reliance on extensive manual annotation for new object categories.
Papers
August 16, 2024
July 1, 2024
January 30, 2024
December 29, 2023
December 17, 2023
November 24, 2023
September 22, 2023
September 1, 2023
May 26, 2023
March 29, 2023
January 2, 2023
November 24, 2021
November 4, 2021