Speech Datasets
Speech datasets are crucial for training and evaluating automatic speech recognition (ASR) and text-to-speech (TTS) systems, as well as other speech processing applications like speech emotion recognition. Current research focuses on creating larger, more diverse datasets encompassing various languages, accents, speaking styles (including those with speech impediments), and recording conditions, alongside developing methods to improve data efficiency (e.g., data pruning, self-training) and address biases. These advancements are vital for improving the accuracy and robustness of speech technologies, leading to broader accessibility and applicability across diverse populations and contexts.
Papers
August 24, 2023
August 14, 2023
August 12, 2023
July 20, 2023
June 16, 2023
May 16, 2023
May 15, 2023
February 12, 2023
October 29, 2022
October 24, 2022
June 27, 2022
June 20, 2022
May 20, 2022
March 31, 2022
February 19, 2022