Image Caption
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research emphasizes improving caption quality, accuracy, and diversity, often focusing on advancements in transformer-based models and contrastive learning approaches, as well as addressing biases and limitations in training data through techniques like data augmentation and deduplication. This field is crucial for enhancing accessibility of visual information, improving cross-modal retrieval systems, and advancing the understanding of human-computer interaction and multimodal learning.
Papers
August 16, 2023
July 17, 2023
June 13, 2023
June 5, 2023
May 24, 2023
May 11, 2023
May 9, 2023
May 4, 2023
May 3, 2023
April 26, 2023
April 4, 2023
March 26, 2023
March 13, 2023
March 12, 2023
February 8, 2023
February 7, 2023
January 26, 2023
December 4, 2022