Speech to Text
Speech-to-text (STT) research aims to accurately and efficiently convert spoken language into written text, encompassing tasks like automatic speech recognition and speech translation. Current efforts focus on improving model robustness and accuracy, particularly for low-resource languages and challenging audio conditions, often leveraging large language models (LLMs) and transformer-based architectures like Whisper and Conformer, alongside techniques like data augmentation and transfer learning. These advancements have significant implications for accessibility, enabling improved human-computer interaction and facilitating the development of more inclusive and versatile applications across various fields.
Papers
February 27, 2023
February 25, 2023
February 3, 2023
January 10, 2023
December 8, 2022
December 3, 2022
November 17, 2022
November 14, 2022
November 12, 2022
November 4, 2022
October 27, 2022
October 26, 2022
October 21, 2022
October 5, 2022
August 28, 2022
July 1, 2022
June 27, 2022
June 5, 2022
May 25, 2022
May 9, 2022