Fast Generation
Fast generation aims to accelerate the computationally expensive process of generating data, particularly in large language models and diffusion models used for image and 3D object creation. Current research focuses on optimizing model architectures (e.g., convolutional networks, diffusion models) through techniques like neural architecture search, model compression, and novel sampling strategies to reduce computational burden without sacrificing output quality. These advancements are crucial for enabling real-time applications of generative AI, improving efficiency in scientific simulations (e.g., particle physics), and expanding the accessibility of powerful generative models to users with limited computational resources.
Papers
November 12, 2024
October 16, 2024
October 10, 2024
October 2, 2024
September 26, 2024
July 29, 2024
May 31, 2024
May 15, 2024
April 8, 2024
October 12, 2023
May 30, 2023
May 24, 2023
March 23, 2023
March 13, 2023
November 23, 2022