Transformer Based Automatic Speech Recognition
Transformer-based automatic speech recognition (ASR) aims to improve the accuracy and efficiency of converting spoken language into text using the transformer neural network architecture. Current research focuses on optimizing model architectures like Conformers and Transformer Transducers, incorporating contextual information for improved rare word recognition, and developing efficient decoding strategies to reduce latency and energy consumption for on-device applications. These advancements are significant because they promise more accurate, faster, and resource-efficient speech recognition systems, impacting fields ranging from virtual assistants to accessibility technologies.
Papers
September 24, 2024
July 4, 2024
June 14, 2024
June 11, 2024
May 2, 2024
September 7, 2023
June 12, 2023
June 7, 2023
May 12, 2023
April 24, 2023
March 20, 2023
February 16, 2023
November 11, 2022
November 2, 2022
September 30, 2022
July 2, 2022
June 15, 2022
March 29, 2022
March 11, 2022