Conformer Transducer

Conformer Transducer models are a leading architecture in automatic speech recognition (ASR), aiming to improve accuracy and efficiency, particularly in challenging scenarios like child speech or streaming applications. Current research focuses on enhancing these models through techniques like incorporating contextual information (e.g., from previous turns in a conversation or language models), addressing limitations in streaming performance via improved normalization methods, and developing model compression strategies for resource-constrained devices. These advancements hold significant promise for improving the accuracy and accessibility of ASR systems across diverse applications and hardware platforms.

Papers

February 9, 2024

Self-consistent context aware conformer transducer for speech recognition
Konstantin Kolokolov, Pavel Pekichev, Karthik Raghunathan
Speech Recognition Automatic Speech Recognition System Self Consistency Contextual Language Model Rare Word Conformer Transducer

January 14, 2024

Promptformer: Prompted Conformer Transducer for ASR
Sergio Duarte-Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gomez-Alanis, Andreas Schwarz, Leif Rädel, Volker Leutnant
Automatic Speech Recognition Bidirectional Encoder Representation From Transformer Acoustic Representation Contextual Cue Textual Context Conformer Transducer

November 7, 2023

A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition
Andrei Barcovschi, Rishabh Jain, Peter Corcoran
Automatic Speech Recognition Comparative Study Automatic Speech Recognition Performance State of the Art Whisper Wav2vec U Child Speech Child Speech Recognition Conformer Transducer

July 20, 2023

Globally Normalising the Transducer for Streaming Speech Recognition
Rogier van Dalen
Speech Recognition Input Sequence RNN Transducer Transformer Transducer Conformer Transducer

October 1, 2022

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda
Knowledge Distillation Librispeech Speech Recognition Sequence Transducer Conformer Transducer

Conformer Transducer

Papers

Self-consistent context aware conformer transducer for speech recognition

Promptformer: Prompted Conformer Transducer for ASR

A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition

Globally Normalising the Transducer for Streaming Speech Recognition

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition