Image Captioning
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research focuses on improving efficiency (e.g., through early exits and knowledge distillation), enhancing performance on fine-grained datasets (e.g., by incorporating object-part details), and developing more robust evaluation metrics (e.g., addressing hallucinations). These advancements are significant for applications ranging from assisting visually impaired individuals to improving image search and retrieval, and are driving innovation in both vision-language models and evaluation methodologies.
Papers
October 18, 2022
October 10, 2022
October 7, 2022
October 4, 2022
September 30, 2022
September 17, 2022
September 16, 2022
September 12, 2022
September 3, 2022
August 17, 2022
August 13, 2022
August 8, 2022
July 26, 2022
July 22, 2022
July 15, 2022
July 12, 2022
July 7, 2022
July 4, 2022