Speech Processing

Speech processing research aims to enable computers to understand, interpret, and generate human speech, focusing on tasks like speech recognition, synthesis, and enhancement. Current efforts concentrate on improving model efficiency (e.g., using linear-complexity attention mechanisms) and robustness across diverse languages and acoustic conditions, often leveraging large language models and self-supervised learning techniques. These advancements are crucial for broader accessibility of speech technology, impacting fields ranging from healthcare (e.g., depression screening) to assistive technologies and improving human-computer interaction.

Papers

December 10, 2021

DEBACER: a method for slicing moderated debates
Thomas Palmeira Ferraz, Alexandre Alcoforado, Enzo Bustos, André Seidel Oliveira, Rodrigo Gerber, Naíde Müller, André Corrêa d'Almeida, Bruno Miguel Veloso, Anna Helena Reali Costa
Practical Method Speech Processing Speech Separation Dialogue Segmentation

November 29, 2021

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe
Spoken Language Understanding Speech Processing NLU Model Speech Processing Task ESPnet ST

November 19, 2021

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech
Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu J. Han
Automatic Speech Recognition Spoken Language Understanding Speech Processing Natural Sounding Speech Benchmark Task

Speech Processing

Papers

DEBACER: a method for slicing moderated debates

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech

Biologically inspired speech emotion recognition

Analysis of Data Augmentation Methods for Low-Resource Maltese ASR

Comparative Study of Speech Analysis Methods to Predict Parkinson's Disease