Speech Corpus
Speech corpora are collections of recorded speech data, crucial for training and evaluating automatic speech recognition (ASR) and text-to-speech (TTS) systems. Current research emphasizes creating diverse corpora representing various accents, languages (including low-resource and indigenous languages), speaking styles, and conditions (e.g., disordered speech), often employing self-supervised learning and transformer-based models like Wav2Vec 2.0 and Whisper for improved accuracy and efficiency. These advancements are vital for improving the accessibility and performance of speech technologies across diverse populations and applications, including healthcare, education, and assistive technologies.
Papers
February 28, 2023
February 26, 2023
December 11, 2022
November 29, 2022
November 23, 2022
November 22, 2022
November 3, 2022
November 1, 2022
October 27, 2022
October 26, 2022
October 15, 2022
August 25, 2022
July 18, 2022
July 17, 2022
July 12, 2022
July 2, 2022
July 1, 2022
June 26, 2022
June 19, 2022