Layout to Image

Layout-to-image (L2I) generation aims to synthesize realistic images guided by predefined layouts, addressing the limitations of text-to-image models in precise spatial control. Current research focuses on improving the accuracy and fidelity of generated instances within these layouts, often employing diffusion models enhanced with modules like cross-attention mechanisms and adversarial training to refine object placement and features. This field is significant for advancing controllable image generation, impacting applications such as image editing, video generation, and data augmentation for computer vision tasks.

Papers

March 30, 2023

LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
Guangcong Zheng, Xianpan Zhou, Xuewei Li, Zhongang Qi, Ying Shan, Xi Li
Image Patch Layout to Image Controllable Diffusion Model

March 25, 2023

Freestyle Layout-to-Image Synthesis
Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang
Text to Image Diffusion Model Semantic Mask Layout to Image Layout to Image Synthesis

February 23, 2023

Text Semantics to Image Generation: A method of building facades design base on Stable Diffusion model
Haoran Ma
Image Generation Practical Method Stable Diffusion Model Layout to Image Facade Image Text Semantics

February 16, 2023

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation
Jiaxin Cheng, Xiao Liang, Xingjian Shi, Tong He, Tianjun Xiao, Mu Li
Generative Model Optimal Layout Layout to Image Semantic Layout

January 17, 2023

GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee
Text to Image Diffusion Model Layout to Image Trainable Layer

June 2, 2022

Modeling Image Composition for Complex Scene Generation
Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao
Transformer Based Transformer Model Scene Generation Image Composition Layout to Image