Caption Generation
Image caption generation aims to automatically create textual descriptions of images, bridging the gap between visual and linguistic information. Current research emphasizes improving caption quality and diversity through advanced transformer-based architectures, often incorporating contextual information from the surrounding scene or external knowledge bases, and exploring techniques like reinforcement learning with human feedback to align generated captions with human preferences. This field is significant for its applications in various domains, including image retrieval, accessibility for visually impaired individuals, and automated content creation for social media and scientific publications.
Papers
November 12, 2024
October 9, 2024
August 14, 2024
August 13, 2024
July 30, 2024
July 24, 2024
July 16, 2024
June 28, 2024
June 11, 2024
April 9, 2024
March 26, 2024
March 13, 2024
March 11, 2024
March 6, 2024
January 3, 2024
December 25, 2023
December 3, 2023
December 1, 2023
November 27, 2023