Speech Generation
Speech generation research aims to create systems that produce natural-sounding and expressive speech from various inputs, such as text or other audio. Current efforts focus on improving model efficiency and controllability, exploring architectures like autoregressive and non-autoregressive models, flow matching, and diffusion models, often incorporating discrete speech units and leveraging techniques like prompting and knowledge distillation. These advancements are significant for applications ranging from virtual assistants and accessibility tools to creative content generation and voice privacy technologies, driving progress in both speech processing and artificial intelligence.
Papers
June 10, 2024
June 8, 2024
June 5, 2024
June 4, 2024
May 21, 2024
April 22, 2024
April 13, 2024
April 10, 2024
April 8, 2024
March 5, 2024
March 4, 2024
February 27, 2024
February 19, 2024
February 14, 2024
February 9, 2024
February 2, 2024
January 31, 2024
January 30, 2024
January 25, 2024