Speech Resynthesis
Speech resynthesis focuses on manipulating and regenerating speech audio, aiming to improve quality, modify speaker characteristics, or translate emotional content while preserving linguistic information. Current research emphasizes efficient model architectures like diffusion models and flow-based models, often incorporating self-supervised learning and techniques like parameter-efficient fine-tuning to address issues like catastrophic forgetting and improve training speed. These advancements are driving improvements in applications such as voice conversion, speech enhancement, and multilingual speech processing, impacting fields ranging from media production to accessibility technologies.
Papers
August 5, 2024
June 20, 2024
February 19, 2024
February 16, 2024
December 22, 2023
December 21, 2023
September 29, 2023
August 22, 2023
August 10, 2023
June 29, 2023
June 2, 2023
June 1, 2023
December 21, 2022
September 21, 2022
February 15, 2022