Multilingual Automatic Speech Recognition Model
Multilingual automatic speech recognition (ASR) aims to build models capable of accurately transcribing speech across multiple languages, addressing the challenge of limited resources for many languages. Current research focuses on improving model configurability and robustness, often employing techniques like weighted cross-entropy for low-resource languages, knowledge distillation for efficiency, and adaptive masking for model compression. These advancements are crucial for broadening access to speech technology globally and improving the accuracy and efficiency of multilingual human-computer interaction.
Papers
October 6, 2024
September 25, 2024
September 4, 2024
June 26, 2024
June 6, 2024
February 28, 2024
December 26, 2023
September 22, 2023
July 24, 2023
June 1, 2023
May 31, 2023
May 30, 2023
April 29, 2023
November 10, 2022
October 30, 2022
October 18, 2022
September 13, 2022
June 25, 2022
May 14, 2022