Neural Speech
Neural speech coding aims to compress and reconstruct speech signals using deep learning models, prioritizing high fidelity at low bitrates for efficient communication. Current research emphasizes improving model efficiency (e.g., through smaller architectures like ConvMixers and optimized quantization techniques such as scalar quantization), robustness to noise and packet loss (via methods like GANs and feature-domain packet loss concealment), and personalization for enhanced quality and reduced complexity. These advancements have significant implications for real-time communication systems, enabling high-quality speech transmission in bandwidth-constrained environments and applications like VoIP and low-power devices.
Papers
October 21, 2024
September 18, 2024
September 9, 2024
August 13, 2024
July 30, 2024
June 13, 2024
May 14, 2024
April 30, 2024
April 3, 2024
March 31, 2024
March 11, 2024
February 2, 2024
January 17, 2024
November 14, 2023
October 13, 2023
September 15, 2023
September 14, 2023
May 22, 2023