Direct Speech to Speech Translation
Direct speech-to-speech translation (S2ST) aims to translate spoken language from one language to another without intermediate text, offering faster and more natural-sounding translations than cascaded approaches. Current research focuses on improving model efficiency and accuracy through techniques like non-autoregressive architectures, pre-training with diverse data (including monolingual and audio-visual data), and the use of discrete speech units. These advancements are significant for bridging language barriers, particularly in low-resource settings, and have implications for applications such as real-time interpretation, subtitling, and voice-assisted technologies.
Papers
October 30, 2024
October 28, 2024
September 26, 2024
September 13, 2024
June 11, 2024
February 25, 2024
December 23, 2023
October 24, 2023
October 11, 2023
September 14, 2023
June 20, 2023
May 28, 2023
May 27, 2023
May 24, 2023
April 10, 2023
December 15, 2022
December 12, 2022
October 31, 2022