Generative Model
Generative models are artificial intelligence systems designed to create new data instances that resemble a training dataset, aiming to learn and replicate the underlying data distribution. Current research emphasizes improving efficiency and controllability, focusing on architectures like diffusion models, autoregressive models, and generative flow networks, as well as refining training algorithms and loss functions. These advancements have significant implications across diverse fields, enabling applications such as realistic image and music generation, protein design, and improved data augmentation techniques for various machine learning tasks.
Papers
Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective
Neta Shaul, Itai Gat, Marton Havasi, Daniel Severo, Anuroop Sriram, Peter Holderrieth, Brian Karrer, Yaron Lipman, Ricky T. Q. Chen
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation
Gianni Franchi, Dat Nguyen Trong, Nacim Belkhir, Guoxuan Xia, Andrea Pilzer
CLAS: A Machine Learning Enhanced Framework for Exploring Large 3D Design Datasets
XiuYu Zhang, Xiaolei Ye, Jui-Che Chang, Yue Fang
The effect of priors on Learning with Restricted Boltzmann Machines
Gianluca Manzan, Daniele Tantari
Unveiling Concept Attribution in Diffusion Models
Quang H. Nguyen, Hoang Phan, Khoa D. Doan
Fast LiDAR Data Generation with Rectified Flows
Kazuto Nakashima, Xiaowen Liu, Tomoya Miyawaki, Yumi Iwashita, Ryo Kurazume
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
Khaled Abud, Sergey Lavrushkin, Alexey Kirillov, Dmitriy Vatolin
Hard Constraint Guided Flow Matching for Gradient-Free Generation of PDE Solutions
Chaoran Cheng, Boran Han, Danielle C. Maddix, Abdul Fatir Ansari, Andrew Stuart, Michael W. Mahoney, Yuyang Wang
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Zehuan Wu, Jingcheng Ni, Xiaodong Wang, Yuxin Guo, Rui Chen, Lewei Lu, Jifeng Dai, Yuwen Xiong
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Shufan Li, Konstantinos Kallidromitis, Akash Gokul, Zichun Liao, Yusuke Kato, Kazuki Kozuka, Aditya Grover
Graph Community Augmentation with GMM-based Modeling in Latent Space
Shintaro Fukushima, Kenji Yamanishi
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers
Chancharik Mitra, Brandon Huang, Tianning Chai, Zhiqiu Lin, Assaf Arbelle, Rogerio Feris, Leonid Karlinsky, Trevor Darrell, Deva Ramanan, Roei Herzig
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
Rong-Cheng Tu, Wenhao Sun, Zhao Jin, Jingyi Liao, Jiaxing Huang, Dacheng Tao
Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough?
Adrian Tormos, Blanca Llauradó, Fernando Núñez, Axel Romero, Dario Garcia-Gasulla, Javier Béjar
Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification
José Fernando Núñez, Jamie Arjona, Javier Béjar
Learning the Evolution of Physical Structure of Galaxies via Diffusion Models
Andrew Lizarraga, Eric Hanchen Jiang, Jacob Nowack, Yun Qi Li, Ying Nian Wu, Bernie Boscoe, Tuan Do