Codec Model
Codec models are neural networks designed to compress and decompress audio data, serving as crucial components in speech and audio processing applications. Current research emphasizes improving the efficiency and fidelity of these models, focusing on architectures that balance compression ratios with the preservation of crucial audio features like emotion and speaker characteristics, often employing vector quantization techniques. This active area of research aims to create codecs that are robust across diverse audio types and bitrates, enabling advancements in speech recognition, audio generation, and real-time communication systems.
Papers
October 14, 2024
September 24, 2024
September 21, 2024
July 31, 2024
July 22, 2024
April 7, 2024
February 20, 2024
February 19, 2024
September 14, 2023
May 4, 2023
November 12, 2022
November 8, 2022