Speech Model
Speech models aim to represent and process spoken language computationally, enabling applications like automatic speech recognition (ASR) and text-to-speech (TTS). Current research emphasizes improving model robustness (e.g., to noise and accents), fairness (mitigating biases against marginalized language varieties), and efficiency (through techniques like knowledge distillation and low-rank adaptation), often utilizing transformer-based architectures and self-supervised learning. These advancements have significant implications for various fields, including healthcare (e.g., voice disorder detection, mental health assessment), language preservation, and human-computer interaction.
Papers
December 6, 2023
November 22, 2023
November 21, 2023
November 17, 2023
November 16, 2023
October 29, 2023
October 26, 2023
September 25, 2023
September 18, 2023
September 14, 2023
August 30, 2023
August 17, 2023
August 14, 2023
July 11, 2023
June 30, 2023
June 14, 2023
June 13, 2023
June 8, 2023
May 30, 2023