Bilingual Automatic Speech Recognition
Bilingual automatic speech recognition (ASR) aims to build systems capable of accurately transcribing speech containing multiple languages, addressing challenges posed by code-switching (mixing languages within utterances) and monolingual segments. Current research focuses on optimizing model architectures like neural transducers and leveraging techniques such as attention mechanisms and byte-level subword representations to improve accuracy and efficiency, particularly in low-resource scenarios. These advancements are significant for improving human-computer interaction in multilingual settings and have implications for applications ranging from language learning tools to real-time translation services.
Papers
June 14, 2024
December 14, 2023
October 21, 2022
May 1, 2022
December 26, 2021
November 29, 2021