Visual Textual Presentation

Visual textual presentation research focuses on automatically generating aesthetically pleasing and informative layouts that integrate text and visual elements, such as in posters or search engine results pages. Current efforts leverage large language models (LLMs) coupled with computer vision techniques, often employing generative adversarial networks (GANs) or other deep learning architectures to create these layouts, sometimes incorporating strategies like coarse-to-fine generation or design sequence formation to improve results. This field is significant for automating design processes, improving user experience in information retrieval, and advancing the capabilities of AI in creative tasks.

Papers