Expressive Speech
Expressive speech synthesis aims to generate speech that conveys not only linguistic content but also emotional nuances and stylistic variations, mirroring the richness of human communication. Current research focuses on improving the expressiveness of models, often employing techniques like diffusion models, variational autoencoders, and graph neural networks, and incorporating linguistic features (e.g., emphasis, semantics) to enhance control and naturalness. Advances in this field have significant implications for applications such as virtual assistants, audiobooks, and accessibility technologies, while also providing valuable insights into the computational modeling of human communication.
Papers
November 23, 2024
November 19, 2024
October 24, 2024
October 2, 2024
October 1, 2024
September 24, 2024
August 12, 2024
August 8, 2024
July 19, 2024
July 6, 2024
July 1, 2024
June 27, 2024
June 13, 2024
June 12, 2024
June 6, 2024
May 29, 2024
May 27, 2024
May 20, 2024
May 19, 2024