Neural Speech Enhancement
Neural speech enhancement aims to improve the quality and intelligibility of speech signals degraded by noise and reverberation. Current research focuses on developing computationally efficient models, such as LSTM-based networks and autoencoders, often incorporating multi-modal data (audio-visual) or leveraging techniques like latent diffusion and multi-band processing for improved performance. These advancements are significant for applications ranging from hearing aids and voice assistants to robust speech recognition in challenging acoustic environments, driving improvements in both objective metrics and subjective listening experiences.
Papers
June 14, 2022
April 21, 2022
March 4, 2022
February 24, 2022
November 3, 2021