Expressive Speech
Expressive speech synthesis aims to generate speech that conveys not only linguistic content but also emotional nuances and stylistic variations, mirroring the richness of human communication. Current research focuses on improving the expressiveness of models, often employing techniques like diffusion models, variational autoencoders, and graph neural networks, and incorporating linguistic features (e.g., emphasis, semantics) to enhance control and naturalness. Advances in this field have significant implications for applications such as virtual assistants, audiobooks, and accessibility technologies, while also providing valuable insights into the computational modeling of human communication.
Papers
January 11, 2025
January 6, 2025
December 26, 2024
December 18, 2024
December 5, 2024
November 27, 2024
November 23, 2024
November 19, 2024
October 24, 2024
October 2, 2024
October 1, 2024
September 24, 2024
August 12, 2024
August 8, 2024
July 19, 2024
July 6, 2024
July 1, 2024
June 27, 2024