Speech Component
Speech component research focuses on understanding and manipulating the constituent elements of speech, such as content, pitch, rhythm, and timbre, for applications like speech recognition, voice conversion, and disorder diagnosis. Current research employs deep learning models, including attention-based encoder-decoder architectures and generative adversarial networks, to disentangle these components, often leveraging self-supervised learning and mutual information estimation techniques. These advancements are improving the accuracy and efficiency of speech processing technologies and providing valuable insights into human speech production and perception, with implications for both clinical applications and human-computer interaction.
Papers
July 25, 2024
July 4, 2024
April 30, 2024
March 28, 2024
October 2, 2023
July 6, 2023
August 18, 2022
May 24, 2022