Text Feature
Text feature research focuses on effectively representing and integrating textual information with visual data, primarily aiming to improve the understanding and interaction between modalities. Current research emphasizes collaborative vision-text optimization within models like CLIP and Vision Transformers (ViTs), exploring techniques such as contrastive learning, hyperbolic embeddings, and prompt engineering to enhance alignment and robustness across diverse tasks. This work is significant for advancing applications like open-vocabulary segmentation, referring image segmentation, and text-guided image generation, impacting fields ranging from medical image analysis to remote sensing and person search.
Papers
October 21, 2024
August 19, 2024
August 1, 2024
June 3, 2024
May 23, 2024
April 12, 2024
January 25, 2024
December 2, 2023
November 13, 2023
September 21, 2023
August 22, 2023
March 24, 2023