Large Scale Speech
Large-scale speech research focuses on developing and improving systems that process and understand vast amounts of spoken language data. Current efforts concentrate on leveraging large language models (LLMs) and self-supervised learning techniques to enhance speech recognition, synthesis, and speaker verification, often employing transformer-based architectures and convolutional neural networks for feature extraction and classification. This work is crucial for advancing applications like voice assistants, multilingual communication tools, and digital health technologies that rely on accurate and efficient speech processing, while also addressing challenges like data bias and robustness.
Papers
October 30, 2024
July 14, 2024
June 25, 2024
June 2, 2024
September 21, 2023
September 19, 2023
September 7, 2023
August 30, 2023
August 22, 2023
July 20, 2023
June 14, 2023
May 18, 2023
October 26, 2022