Image Captioning
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research focuses on improving efficiency (e.g., through early exits and knowledge distillation), enhancing performance on fine-grained datasets (e.g., by incorporating object-part details), and developing more robust evaluation metrics (e.g., addressing hallucinations). These advancements are significant for applications ranging from assisting visually impaired individuals to improving image search and retrieval, and are driving innovation in both vision-language models and evaluation methodologies.
Papers
March 7, 2022
March 3, 2022
February 28, 2022
February 21, 2022
February 14, 2022
February 11, 2022
January 31, 2022
January 30, 2022
January 23, 2022
January 6, 2022
January 4, 2022
December 13, 2021
December 2, 2021
November 29, 2021
November 24, 2021
November 23, 2021