Non Autoregressive ASR

Non-autoregressive (NAR) automatic speech recognition (ASR) aims to improve the speed and efficiency of speech-to-text conversion by processing the entire audio sequence simultaneously, unlike traditional autoregressive methods. Current research focuses on enhancing the accuracy of NAR ASR, particularly through techniques like incorporating lexical information, leveraging pre-trained models, and developing novel architectures such as folded encoders and contextual Paraformers to address limitations in handling rare words and customizing hotwords. These advancements offer significant potential for faster and more efficient speech processing in various applications, including real-time transcription and personalized voice assistants.

Papers

September 24, 2024

Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
Leonid Velikovich, Christopher Li, Diamantino Caseiro, Shankar Kumar, Pat Rondon, Kandarp Joshi, Xavier Velez
End to End Automatic Speech Recognition Model Spelling Correction Grapheme to Phoneme Non Autoregressive ASR

August 7, 2023

SeACo-Paraformer: A Non-Autoregressive ASR System with Flexible and Effective Hotword Customization Ability
Xian Shi, Yexin Yang, Zerui Li, Yanni Chen, Zhifu Gao, Shiliang Zhang
Extension Study ASR System Contextual Asr Non Autoregressive ASR

May 18, 2023

A Lexical-aware Non-autoregressive Transformer-based ASR Model
Chong-En Lin, Kuan-Yu Chen
Lexical Aware Non Autoregressive Automatic Speech Recognition Non Autoregressive ASR

January 29, 2023

Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model
Xian Shi, Yanni Chen, Shiliang Zhang, Zhijie Yan
Automatic Speech Recognition ASR System Timestamp Annotation Non Autoregressive End to End Timestamp Supervision Non Autoregressive ASR

February 17, 2022

Non-Autoregressive ASR with Self-Conditioned Folded Encoders
Tatsuya Komatsu
Encoder Side State of the Art Encoders Self Restraint Non Autoregressive ASR

Non Autoregressive ASR

Papers

Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices

SeACo-Paraformer: A Non-Autoregressive ASR System with Flexible and Effective Hotword Customization Ability

A Lexical-aware Non-autoregressive Transformer-based ASR Model

Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model

Non-Autoregressive ASR with Self-Conditioned Folded Encoders