Speech Transcription
Speech transcription, the automated conversion of spoken language into text, aims to create accurate and efficient systems for diverse applications. Current research focuses on improving the speed and accuracy of transformer-based models like Whisper, addressing challenges posed by noisy or diverse audio data, and exploring end-to-end approaches that integrate speech recognition with other tasks such as summarization, translation, and emotion recognition. These advancements have significant implications for accessibility (e.g., subtitling, transcription of legal proceedings), healthcare (e.g., Alzheimer's diagnosis), and language learning, particularly in low-resource settings where large labeled datasets are scarce.
Papers
September 24, 2024
September 11, 2024
July 30, 2024
June 18, 2024
June 14, 2024
June 9, 2024
April 30, 2024
March 27, 2024
November 3, 2023
October 26, 2023
September 24, 2023
August 5, 2023
July 11, 2023
June 22, 2023
June 6, 2023
January 18, 2023
November 29, 2022
November 18, 2022
November 17, 2022
November 16, 2022