Speech Generation
Speech generation research aims to create systems that produce natural-sounding and expressive speech from various inputs, such as text or other audio. Current efforts focus on improving model efficiency and controllability, exploring architectures like autoregressive and non-autoregressive models, flow matching, and diffusion models, often incorporating discrete speech units and leveraging techniques like prompting and knowledge distillation. These advancements are significant for applications ranging from virtual assistants and accessibility tools to creative content generation and voice privacy technologies, driving progress in both speech processing and artificial intelligence.
Papers
June 14, 2023
June 7, 2023
June 5, 2023
June 3, 2023
May 31, 2023
April 25, 2023
March 7, 2023
March 3, 2023
February 9, 2023
December 8, 2022
October 19, 2022
October 12, 2022
September 15, 2022
July 13, 2022
July 5, 2022
July 2, 2022
June 16, 2022
April 7, 2022