Speech Generation
Speech generation research aims to create systems that produce natural-sounding and expressive speech from various inputs, such as text or other audio. Current efforts focus on improving model efficiency and controllability, exploring architectures like autoregressive and non-autoregressive models, flow matching, and diffusion models, often incorporating discrete speech units and leveraging techniques like prompting and knowledge distillation. These advancements are significant for applications ranging from virtual assistants and accessibility tools to creative content generation and voice privacy technologies, driving progress in both speech processing and artificial intelligence.
Papers
January 22, 2024
January 5, 2024
January 3, 2024
December 25, 2023
December 15, 2023
November 24, 2023
November 13, 2023
November 6, 2023
October 30, 2023
October 26, 2023
October 23, 2023
October 2, 2023
September 14, 2023
September 8, 2023
September 5, 2023
August 14, 2023
July 23, 2023
July 7, 2023
July 3, 2023