Low Latency Speech Enhancement
Low-latency speech enhancement aims to improve speech intelligibility in noisy environments while minimizing processing delay, crucial for real-time applications like hearing aids and teleconferencing. Current research emphasizes developing efficient deep learning models, including recurrent and convolutional neural networks, often integrated with techniques like neural Wiener filtering or autoregressive generation, to achieve high-quality enhancement with latencies under 10ms, sometimes even below 5ms. These advancements are significantly impacting fields requiring immediate audio processing, offering improved user experience and enabling new possibilities in assistive technologies and communication systems.
Papers
September 5, 2024
August 14, 2024
April 30, 2024
January 15, 2024
October 13, 2023
June 5, 2023
February 26, 2023
November 3, 2022
April 12, 2022
March 30, 2022