Prosodic Feature
Prosodic features, encompassing aspects of speech like pitch, intensity, and rhythm, are crucial for conveying meaning and emotion beyond the literal words spoken. Current research focuses on accurately modeling and manipulating these features in applications such as speech synthesis, editing, and voice conversion, often employing deep learning models like diffusion models, variational autoencoders, and actor-critic reinforcement learning. This work is significant for improving the naturalness and expressiveness of synthetic speech, enhancing accessibility for individuals with communication disorders, and advancing our understanding of human communication itself.
Papers
May 18, 2024
May 15, 2024
May 2, 2024
April 27, 2024
April 26, 2024
April 16, 2024
March 21, 2024
March 13, 2024
March 6, 2024
March 3, 2024
February 22, 2024
February 20, 2024
February 1, 2024
December 21, 2023
December 16, 2023
November 28, 2023
November 13, 2023
October 23, 2023