Expressive Speech
Expressive speech synthesis aims to generate speech that conveys not only linguistic content but also emotional nuances and stylistic variations, mirroring the richness of human communication. Current research focuses on improving the expressiveness of models, often employing techniques like diffusion models, variational autoencoders, and graph neural networks, and incorporating linguistic features (e.g., emphasis, semantics) to enhance control and naturalness. Advances in this field have significant implications for applications such as virtual assistants, audiobooks, and accessibility technologies, while also providing valuable insights into the computational modeling of human communication.
Papers
April 16, 2024
March 25, 2024
March 14, 2024
March 12, 2024
February 23, 2024
February 18, 2024
January 1, 2024
December 23, 2023
December 15, 2023
November 2, 2023
October 26, 2023
September 22, 2023
September 3, 2023
August 31, 2023
August 25, 2023
August 22, 2023
July 29, 2023
July 3, 2023
June 9, 2023