Speech Model
Speech models aim to represent and process spoken language computationally, enabling applications like automatic speech recognition (ASR) and text-to-speech (TTS). Current research emphasizes improving model robustness (e.g., to noise and accents), fairness (mitigating biases against marginalized language varieties), and efficiency (through techniques like knowledge distillation and low-rank adaptation), often utilizing transformer-based architectures and self-supervised learning. These advancements have significant implications for various fields, including healthcare (e.g., voice disorder detection, mental health assessment), language preservation, and human-computer interaction.
Papers
May 23, 2023
May 18, 2023
May 16, 2023
May 14, 2023
April 27, 2023
March 30, 2023
March 8, 2023
March 3, 2023
February 27, 2023
February 24, 2023
February 23, 2023
December 8, 2022
December 2, 2022
November 30, 2022
November 29, 2022
November 10, 2022
October 14, 2022
October 3, 2022
September 26, 2022
August 24, 2022