Image to Text Task
Image-to-text tasks aim to automatically generate textual descriptions from images, a crucial area in artificial intelligence bridging computer vision and natural language processing. Current research focuses on improving model accuracy and robustness, particularly using transformer-based architectures like VL-BART and VL-T5, while also addressing challenges such as adversarial attacks and ensuring semantic alignment between generated text and image content. These advancements have significant implications for various applications, including social media analysis, content generation, and accessibility technologies, driving ongoing efforts to enhance model efficiency and security.
Papers
October 2, 2024
September 14, 2023
August 3, 2023
June 13, 2023
May 17, 2023
January 5, 2023
October 20, 2022