Automatic Speech Recognition Model
Automatic speech recognition (ASR) models aim to accurately convert spoken language into text, a crucial task with broad applications. Current research emphasizes improving ASR performance in challenging scenarios, such as low-resource languages, accented speech, and noisy environments, often leveraging large language models (LLMs) and techniques like parameter-efficient fine-tuning and self-supervised learning. These advancements are driven by the need for more robust, accurate, and equitable ASR systems across diverse languages and speaker demographics, impacting fields ranging from healthcare to legal proceedings.
Papers
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models
Hsuan Su, Ting-Yao Hu, Hema Swetha Koppula, Raviteja Vemulapalli, Jen-Hao Rick Chang, Karren Yang, Gautam Varma Mantena, Oncel Tuzel
Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Mohamed Nabih Ali, Alessio Brutti
SlothSpeech: Denial-of-service Attack Against Speech Recognition Models
Mirazul Haque, Rutvij Shah, Simin Chen, Berrak Şişman, Cong Liu, Wei Yang
Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication
Emin Cagatay Nakilcioglu, Maximilian Reimann, Ole John