Dysarthric Speech Reconstruction

Dysarthric speech reconstruction (DSR) aims to convert unintelligible or distorted speech caused by neurological disorders into clear, natural-sounding speech. Current research heavily utilizes multi-modal approaches, incorporating visual information (lip movements) alongside audio, and employs neural network architectures like neural codec language models and encoder-decoder frameworks, often incorporating techniques like adversarial training and self-supervised learning to improve speaker similarity and prosody. These advancements significantly improve speech intelligibility and naturalness, offering substantial potential for enhancing communication and quality of life for individuals with dysarthria.

Papers

June 12, 2024

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Xueyuan Chen, Dongchao Yang, Dingdong Wang, Xixin Wu, Zhiyong Wu, Helen Meng
Speech Encoder Dysarthric Speech Codec Language Model Phonetic Embeddings Dysarthric Speech Reconstruction

January 31, 2024

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction
Xueyuan Chen, Yuejiao Wang, Xixin Wu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng
Audio Visual Speech Intelligibility Dysarthric Speech Phonological Reconstruction Dysarthric Speech Reconstruction

January 26, 2024

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization
Yuejiao Wang, Xixin Wu, Disong Wang, Lingwei Meng, Helen Meng
Generative Adversarial Dysarthric Speech Phonological Reconstruction Speaker Normalization Dysarthric Speech Reconstruction

January 8, 2024

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss
Yusheng Tian, Jingyu Li, Tan Lee
Synthesized Speech Generated Text Synthetic Voice Reconstruction Loss Personalized Speech Speech Sound Disorder Dysarthric Speech Reconstruction

February 18, 2022

Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation
Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng
Dysarthric Speech Adversarial Adaptation Dysarthric Speech Reconstruction

Dysarthric Speech Reconstruction

Papers

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss

Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation