Image Caption
Image captioning aims to automatically generate descriptive text for images, bridging the gap between computer vision and natural language processing. Current research emphasizes improving caption quality, accuracy, and diversity, often focusing on advancements in transformer-based models and contrastive learning approaches, as well as addressing biases and limitations in training data through techniques like data augmentation and deduplication. This field is crucial for enhancing accessibility of visual information, improving cross-modal retrieval systems, and advancing the understanding of human-computer interaction and multimodal learning.
Papers
March 23, 2024
March 20, 2024
March 18, 2024
March 17, 2024
March 16, 2024
March 12, 2024
February 28, 2024
February 27, 2024
February 23, 2024
December 14, 2023
December 5, 2023
December 1, 2023
November 7, 2023
October 31, 2023
October 20, 2023
October 19, 2023
October 17, 2023
October 5, 2023