Speech to Text
Speech-to-text (STT) research aims to accurately and efficiently convert spoken language into written text, encompassing tasks like automatic speech recognition and speech translation. Current efforts focus on improving model robustness and accuracy, particularly for low-resource languages and challenging audio conditions, often leveraging large language models (LLMs) and transformer-based architectures like Whisper and Conformer, alongside techniques like data augmentation and transfer learning. These advancements have significant implications for accessibility, enabling improved human-computer interaction and facilitating the development of more inclusive and versatile applications across various fields.
Papers
June 19, 2024
June 13, 2024
June 11, 2024
June 10, 2024
June 7, 2024
June 6, 2024
June 3, 2024
May 16, 2024
May 13, 2024
April 10, 2024
February 8, 2024
February 2, 2024
January 22, 2024
January 18, 2024
December 28, 2023
December 2, 2023
October 22, 2023
September 27, 2023