Speech Codec
Speech codecs are algorithms that compress and decompress speech signals for efficient transmission and storage, aiming to minimize bitrate while preserving audio quality and minimizing latency. Current research focuses on developing neural network-based codecs, employing architectures like transformers and vector quantization, often incorporating techniques such as disentangled representation learning and adaptive feature fusion to improve fidelity and efficiency, particularly at low bitrates. These advancements are significant for applications ranging from voice communication and text-to-speech synthesis to speech-based AI systems, promising improvements in both quality and resource utilization.
Papers
September 9, 2024
June 4, 2024
April 30, 2024
April 3, 2024
March 31, 2024
October 11, 2023
September 25, 2023
August 31, 2023
July 25, 2023
July 18, 2022
May 11, 2022