Image Captioning
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research focuses on improving efficiency (e.g., through early exits and knowledge distillation), enhancing performance on fine-grained datasets (e.g., by incorporating object-part details), and developing more robust evaluation metrics (e.g., addressing hallucinations). These advancements are significant for applications ranging from assisting visually impaired individuals to improving image search and retrieval, and are driving innovation in both vision-language models and evaluation methodologies.
Papers
June 16, 2022
June 14, 2022
June 7, 2022
June 3, 2022
May 28, 2022
May 26, 2022
May 25, 2022
May 24, 2022
May 9, 2022
May 4, 2022
April 28, 2022
April 27, 2022
April 15, 2022
April 8, 2022
March 29, 2022
March 28, 2022