Captioning Datasets
Image captioning datasets are crucial for training and evaluating models that generate textual descriptions of images, aiming to bridge the gap between computer vision and natural language processing. Current research focuses on improving dataset quality by addressing noise and bias in existing datasets, developing more robust evaluation metrics, and exploring novel training strategies like self-supervised learning and contrastive methods, often employing transformer-based architectures. These advancements are vital for enhancing the accuracy and fluency of generated captions, with implications for applications ranging from image retrieval and accessibility tools to content creation and analysis across diverse domains.
Papers
February 23, 2023
January 26, 2023
November 14, 2022
October 10, 2022
September 21, 2022
August 18, 2022
July 15, 2022
May 7, 2022
May 3, 2022
April 10, 2022
April 1, 2022
March 12, 2022
December 9, 2021
November 24, 2021