Generative Audio

Generative audio focuses on creating realistic and high-quality audio using artificial intelligence, aiming to improve audio restoration, compression, and synthesis. Current research emphasizes developing efficient and controllable models, employing architectures like autoencoders, GANs, transformers, diffusion models, and neural audio codecs, often incorporating techniques like latent space manipulation and consistency modeling to enhance audio quality and semantic control. This field is significant for its potential applications in music production, sound design, speech processing, and assistive technologies, while also raising important ethical considerations regarding copyright, authenticity, and potential misuse.

Papers