Text Image
Text-image research focuses on understanding and generating images containing text, aiming to improve the accuracy, realism, and diversity of such images. Current research heavily utilizes diffusion models, often enhanced with techniques like glyph-aware training and dual translation learning, to address challenges such as legible text generation, multi-concept synthesis, and cross-lingual capabilities. This field is significant for applications in combating misinformation (detecting text-image inconsistencies), improving scene text recognition, and enabling novel image editing and generation tasks, ultimately advancing both computer vision and natural language processing.
44papers
Papers - Page 2
March 25, 2024
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian YuanRefining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Sanyam Lakhanpal, Shivang Chopra, Vinija Jain, Aman Chadha, Man Luo
March 5, 2024
January 26, 2024
November 28, 2023
October 20, 2023
September 26, 2023
September 8, 2023