Recurrent Neural Network Transducer
Recurrent Neural Network Transducers (RNN-Ts) are a prominent architecture for end-to-end automatic speech recognition (ASR), aiming to improve accuracy and efficiency in streaming applications. Current research focuses on optimizing RNN-T models for various constraints, including low latency, reduced memory footprint (through techniques like binarization and knowledge distillation), and robustness to noisy data or diverse acoustic conditions. These advancements are significant for deploying accurate and efficient ASR systems on resource-limited devices and improving the performance of various speech-related applications, such as voice assistants and conversational AI.
Papers
July 14, 2024
June 5, 2024
December 15, 2023
September 26, 2023
September 5, 2023
July 17, 2023
June 21, 2023
May 12, 2023
March 10, 2023
February 28, 2023
December 29, 2022
December 21, 2022
December 16, 2022
October 29, 2022
October 17, 2022
October 14, 2022
September 29, 2022
September 13, 2022
July 28, 2022