Speech Codec

Speech codecs are algorithms that compress and decompress speech signals for efficient transmission and storage, aiming to minimize bitrate while preserving audio quality and minimizing latency. Current research focuses on developing neural network-based codecs, employing architectures like transformers and vector quantization, often incorporating techniques such as disentangled representation learning and adaptive feature fusion to improve fidelity and efficiency, particularly at low bitrates. These advancements are significant for applications ranging from voice communication and text-to-speech synthesis to speech-based AI systems, promising improvements in both quality and resource utilization.

Papers