Visual Text
Visual text processing focuses on understanding and generating text within images, aiming to bridge the gap between computer vision and natural language processing. Current research emphasizes improving the accuracy and legibility of text generated by diffusion models, addressing challenges like misspelling and the generation of unsafe content through novel jailbreak techniques, and developing robust evaluation metrics for generalizability. This field is crucial for advancing applications like document understanding, image captioning, and text-to-image synthesis, impacting areas ranging from accessibility to content moderation.
Papers
October 24, 2024
October 6, 2024
September 27, 2024
September 23, 2024
September 4, 2024
August 19, 2024
June 5, 2024
June 1, 2024
March 28, 2024
March 25, 2024
February 5, 2024
January 29, 2024
December 21, 2023
October 18, 2023
October 9, 2023
August 7, 2023
May 29, 2023
May 11, 2023
March 29, 2023