Phoneme Representation
Phoneme representation research focuses on developing effective computational models of speech sounds for applications like speech recognition, synthesis, and pronunciation assessment. Current efforts concentrate on improving the robustness and efficiency of these representations, often employing techniques like adversarial training, multilingual pre-trained models (e.g., BERT-based architectures), and the integration of non-verbal cues. These advancements are crucial for enhancing the accuracy and generalizability of speech technologies across diverse languages and speaking styles, impacting fields ranging from human-computer interaction to language learning.
Papers
June 14, 2024
September 14, 2023
August 12, 2023
July 24, 2023
May 31, 2023
September 13, 2022
March 31, 2022