Expressive Speech
Expressive speech synthesis aims to generate speech that conveys not only linguistic content but also emotional nuances and stylistic variations, mirroring the richness of human communication. Current research focuses on improving the expressiveness of models, often employing techniques like diffusion models, variational autoencoders, and graph neural networks, and incorporating linguistic features (e.g., emphasis, semantics) to enhance control and naturalness. Advances in this field have significant implications for applications such as virtual assistants, audiobooks, and accessibility technologies, while also providing valuable insights into the computational modeling of human communication.
Papers
May 20, 2023
April 28, 2023
March 9, 2023
January 29, 2023
November 26, 2022
November 16, 2022
November 9, 2022
November 2, 2022
October 12, 2022
September 7, 2022
July 13, 2022
June 30, 2022
June 29, 2022
June 28, 2022
June 14, 2022
April 21, 2022
April 10, 2022
March 28, 2022
February 19, 2022