Multilingual Tt

Multilingual text-to-speech (TTS) research aims to create systems capable of generating natural-sounding speech in multiple languages, often addressing challenges posed by limited data availability for less-resourced languages. Current efforts focus on developing robust model architectures that leverage techniques like self-supervised learning, disentangled representations, and translation-enhanced approaches to improve cross-lingual transfer and maintain speaker identity and accent. These advancements are significant because they promise to broaden access to speech technology across diverse linguistic communities and facilitate applications such as multilingual dubbing and cross-lingual communication aids.

Papers

July 19, 2024

Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings
Praveen Srinivasa Varadhan, Ashwin Sankar, Giri Raju, Mitesh M. Khapra
Low Resource Indian Language Expressive Speech Expressive Speech Synthesis Multilingual Tt

May 30, 2023

Translation-Enhanced Multilingual Text-to-Image Generation
Yaoyiran Li, Ching-Yun Chang, Stephen Rawls, Ivan Vulić, Anna Korhonen
Natural Language Processing Machine Translation Text to Image Generation Multilingual Tt

March 1, 2023

ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
Neil Shah, Saiteja Kosgi, Vishal Tambrahalli, Neha Sahipjohn, Niranjan Pedanekar, Vineet Gandhi
Indian Language Self Supervised Speech Representation Multilingual Scenario Text to Speech Synthesis Multilingual Tt

January 24, 2023

Multilingual Multiaccented Multispeaker TTS with RADTTS
Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro
Speech Synthesis Accented Speech Multilingual Tt

October 27, 2022

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Takaaki Saeki, Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, Bhuvana Ramabhadran
Speech to Text Self Supervised Speech Representation High Quality Speech Speech Text Multilingual Tt

June 24, 2022

SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Hyunjae Cho, Wonbin Jung, Junhyeok Lee, Sang Hoon Woo
End to End Text to Speech Speech Synthesis Cross Lingual Natural Language Inference Multilingual Tt

May 13, 2022

Talking Face Generation with Multilingual TTS
Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim
Synthesized Speech Face Generation Multilingual Speech Talking Face Video Multilingual Tt