Image Generation
Image generation research focuses on creating realistic and diverse images from various inputs, such as text, sketches, or other images, aiming for greater control and efficiency. Current efforts center on refining diffusion and autoregressive models, exploring techniques like dynamic computation, disentangled feature representation, and multimodal integration to improve image quality, controllability, and computational efficiency. These advancements have significant implications for accessible communication, creative content production, and various computer vision tasks, offering powerful tools for both scientific investigation and practical applications. Ongoing work addresses challenges like handling multiple conditions, improving evaluation metrics, and mitigating biases and limitations in existing models.
Papers
Text-guided Controllable Mesh Refinement for Interactive 3D Modeling
Yun-Chun Chen, Selena Ling, Zhiqin Chen, Vladimir G. Kim, Matheus Gadelha, Alec Jacobson
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang
$\Delta$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen
It's a Feature, Not a Bug: Measuring Creative Fluidity in Image Generators
Aditi Ramaswamy, Melane Navaratnarajah, Hana Chockler
Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline
Jan Lippemeier, Stefanie Hittmeyer, Oliver Niehörster, Markus Lange-Hegermann
Layout Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli, Jindong Jiang, Di Liu, Licheng Yu, Xiaoliang Dai, Ankit Ramchandani, Guan Pang, Dimitris N. Metaxas, Praveen Krishnan
Expert-Guided Extinction of Toxic Tokens for Debiased Generation
Xueyao Sun, Kaize Shi, Haoran Tang, Guandong Xu, Qing Li
Going beyond Compositions, DDPMs Can Produce Zero-Shot Interpolations
Justin Deschenaux, Igor Krawczuk, Grigorios Chrysos, Volkan Cevher
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu, Peiye Liu
Inpaint Biases: A Pathway to Accurate and Unbiased Image Generation
Jiyoon Myung, Jihyeon Park
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin, Michaël Gharbi, Taesung Park, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman
Conditional Diffusion on Web-Scale Image Pairs leads to Diverse Image Variations
Manoj Kumar, Neil Houlsby, Emiel Hoogeboom
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu, Lingzhi Zhang, Jianbo Shi
OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
Shuheng Ge, Haoyu Xing, Li Zhang, Xiangqian Wu