Multi Label Image Recognition
Multi-label image recognition aims to automatically assign multiple labels to a single image, reflecting its diverse content. Current research heavily utilizes vision-language models (VLMs) and focuses on improving cross-modal alignment between visual and textual information, often employing techniques like prompt tuning and graph convolutional networks to capture complex label relationships and handle incomplete or noisy annotations. These advancements are significant for applications requiring fine-grained image understanding, such as object detection in complex scenes and large-scale image indexing, and are driving progress in efficient training methods for data-scarce scenarios.
Papers
July 30, 2024
July 26, 2024
March 2, 2024
January 31, 2024
October 27, 2023
August 28, 2023
August 3, 2023
July 15, 2023
April 21, 2023
November 27, 2022
November 23, 2022
November 15, 2022
May 26, 2022
May 23, 2022
May 11, 2022
April 22, 2022
April 8, 2022
March 10, 2022
March 4, 2022