Speech Translation
Speech translation (ST) aims to automatically convert spoken language in one language into written or spoken text in another, bridging communication barriers. Current research heavily utilizes large language models (LLMs) integrated with speech foundation models (SFMs), often employing techniques like chain-of-thought prompting and multimodal approaches to improve accuracy and reduce latency, particularly in simultaneous ST. These advancements are significant for improving cross-lingual communication in various applications, from real-time interpretation to accessibility tools, and are driving innovation in both model architectures and evaluation methodologies.
Papers
December 18, 2023
December 8, 2023
December 2, 2023
November 12, 2023
November 1, 2023
October 31, 2023
October 24, 2023
October 23, 2023
October 20, 2023
October 19, 2023
October 13, 2023
October 3, 2023
September 30, 2023
September 28, 2023
September 27, 2023
September 26, 2023
September 21, 2023
September 20, 2023