CLIP Score
CLIP score is a metric used to evaluate the alignment between image and text embeddings generated by contrastive language-image pre-training (CLIP) models. Current research focuses on improving CLIP score's effectiveness for data selection in training larger visual-language models, mitigating biases like the over-reliance on textual cues within images, and adapting it for downstream tasks such as object counting and video quality assessment. These efforts aim to enhance the robustness and reliability of CLIP models, leading to improved performance in various applications including image retrieval, caption generation, and robotic perception.
Papers
October 15, 2024
October 10, 2024
October 9, 2024
September 23, 2024
July 8, 2024
May 29, 2024
May 2, 2024
April 3, 2024
January 24, 2024
December 21, 2023
September 24, 2023
July 13, 2023
May 12, 2023
April 13, 2023
December 15, 2022
October 17, 2022