Speech Enhancement
Speech enhancement aims to improve the clarity and intelligibility of speech signals degraded by noise and reverberation, crucial for applications like hearing aids and voice assistants. Current research focuses on developing computationally efficient models, including lightweight convolutional neural networks, recurrent neural networks (like LSTMs), and diffusion models, often incorporating techniques like multi-channel processing, attention mechanisms, and self-supervised learning to achieve high performance with minimal latency. These advancements are driving progress towards more robust and resource-efficient speech enhancement systems for a wide range of real-world applications, particularly in low-power devices and challenging acoustic environments. The field also explores the integration of visual information and advanced signal processing techniques to further enhance performance.
Papers - Page 13
Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation
Speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle
Self-Supervised Learning for Speech Enhancement through Synthesis
Cold Diffusion for Speech Enhancement
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration