Speech Synthesis
Speech synthesis aims to generate human-like speech from text or other inputs, focusing on improving naturalness, expressiveness, and efficiency. Current research emphasizes advancements in model architectures like diffusion models, generative adversarial networks (GANs), and large language models (LLMs), often incorporating techniques such as low-rank adaptation (LoRA) for parameter efficiency and improved control over aspects like emotion and prosody. These improvements have significant implications for applications ranging from assistive technologies for the visually impaired to creating realistic virtual avatars and enhancing accessibility for under-resourced languages.
Papers
September 14, 2023
September 13, 2023
September 12, 2023
September 8, 2023
September 6, 2023
August 29, 2023
August 24, 2023
August 21, 2023
August 16, 2023
August 14, 2023
August 5, 2023
August 3, 2023
July 19, 2023
July 11, 2023
July 5, 2023
July 4, 2023
June 29, 2023