Simultaneous Speech Translation
Simultaneous speech translation (SST) aims to generate real-time translations of spoken language, posing significant challenges in balancing translation quality with low latency. Current research focuses on improving the efficiency and accuracy of end-to-end models, often employing transformer architectures with techniques like blockwise processing, adaptive decision policies (e.g., integrate-and-fire mechanisms), and novel training strategies to mitigate gradient conflicts and optimize the quality-latency trade-off. These advancements are crucial for enhancing human-computer interaction and cross-lingual communication in various applications, such as real-time subtitling, interpreting services, and multilingual meetings.
Papers
October 21, 2024
September 24, 2024
August 18, 2024
August 14, 2024
July 31, 2024
July 30, 2024
June 20, 2024
June 14, 2024
June 11, 2024
June 6, 2024
June 1, 2024
October 27, 2023
October 17, 2023
October 6, 2023
September 20, 2023
July 3, 2023
June 14, 2023