Level Pronunciation

Level pronunciation research focuses on automatically assessing the accuracy of spoken language, providing detailed feedback at various granularities (phoneme, word, utterance) and across multiple aspects (accuracy, fluency, completeness). Current research emphasizes the development of deep learning models, often employing transformer architectures and attention mechanisms, to analyze acoustic features and compare them against reference pronunciations, sometimes leveraging phone embeddings and multi-source information. These advancements aim to improve the accuracy and efficiency of computer-assisted pronunciation training systems, ultimately benefiting language learners and researchers alike by providing more effective and nuanced feedback on pronunciation.

Papers