Image Captioning
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research focuses on improving efficiency (e.g., through early exits and knowledge distillation), enhancing performance on fine-grained datasets (e.g., by incorporating object-part details), and developing more robust evaluation metrics (e.g., addressing hallucinations). These advancements are significant for applications ranging from assisting visually impaired individuals to improving image search and retrieval, and are driving innovation in both vision-language models and evaluation methodologies.
Papers
March 24, 2024
March 23, 2024
March 21, 2024
March 20, 2024
March 12, 2024
March 10, 2024
March 4, 2024
February 28, 2024
February 27, 2024
February 21, 2024
February 19, 2024
February 13, 2024
February 9, 2024
February 8, 2024
February 7, 2024
February 5, 2024
February 1, 2024
January 16, 2024
January 10, 2024