Speech to Unit
Speech-to-unit (S2U) translation focuses on converting speech into discrete, quantized representations ("units") for improved efficiency and performance in speech processing tasks. Current research emphasizes self-supervised learning and transformer-based architectures to generate these units, often within a direct speech-to-speech translation (S2ST) framework, bypassing the need for intermediate text transcription. This approach is particularly relevant for low-resource languages and applications requiring real-time translation, offering potential improvements in speed, accuracy, and multilingual capabilities for various applications like voice assistants and cross-lingual communication.
Papers
July 8, 2024
June 4, 2024
May 30, 2024
February 25, 2024
December 23, 2023
August 3, 2023
July 17, 2023
June 28, 2023
May 15, 2023
December 15, 2022
November 3, 2022
October 7, 2022
April 6, 2022