Speech Model
Speech models aim to represent and process spoken language computationally, enabling applications like automatic speech recognition (ASR) and text-to-speech (TTS). Current research emphasizes improving model robustness (e.g., to noise and accents), fairness (mitigating biases against marginalized language varieties), and efficiency (through techniques like knowledge distillation and low-rank adaptation), often utilizing transformer-based architectures and self-supervised learning. These advancements have significant implications for various fields, including healthcare (e.g., voice disorder detection, mental health assessment), language preservation, and human-computer interaction.
Papers
October 26, 2024
September 21, 2024
September 14, 2024
September 12, 2024
September 9, 2024
September 4, 2024
September 3, 2024
August 29, 2024
August 26, 2024
August 20, 2024
July 23, 2024
July 15, 2024
July 3, 2024
June 29, 2024
June 25, 2024
June 20, 2024
June 16, 2024
June 13, 2024
June 12, 2024