Tt System

Text-to-speech (TTS) systems aim to synthesize natural-sounding human speech from written text. Current research focuses on improving the quality and efficiency of TTS, particularly for longer passages, by incorporating contextual information across sentences and employing techniques like memory-cached recurrence and linearized self-attention within models such as VITS. This work is driven by the need for more expressive and computationally efficient TTS, with applications ranging from improved accessibility tools to advancements in speech synthesis for low-resource languages, as exemplified by efforts to expand corpora for languages like Kazakh.

Papers

July 3, 2023

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee
Speech Analysis Speech Generation Expressive Speech Cross Utterance Tt System

January 27, 2022

The MSXF TTS System for ICASSP 2022 ADD Challenge
Chunyong Yang, Pengfei Liu, Yanli Chen, Hongbin Wang, Min Liu
Tt System

January 15, 2022

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol
Text to Speech Speaker Information Significant Topic Tt System

Tt System

Papers

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

The MSXF TTS System for ICASSP 2022 ADD Challenge

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics