Multilingual Speech Recognition

Multilingual speech recognition (MSR) aims to build systems capable of accurately transcribing speech from multiple languages, addressing the limitations of monolingual systems. Current research focuses on improving the efficiency and accuracy of multilingual models, exploring architectures like transformers, mixture-of-experts, and parameter-efficient fine-tuning methods to handle both code-switching and low-resource languages. These advancements are crucial for bridging language barriers in applications such as multilingual assistants, international communication, and accessibility technologies, impacting both the development of robust AI models and the expansion of global communication.

Papers