Noisy Speech

Noisy speech presents a significant challenge to accurate speech processing, hindering applications like speech recognition, emotion recognition, and voice conversion. Current research focuses on developing robust models, often employing deep neural networks (DNNs) such as GANs and transformers, to enhance noisy speech signals or directly model noisy speech characteristics for improved performance in various tasks. These advancements are crucial for improving the reliability and usability of speech-based technologies in real-world scenarios, impacting fields ranging from assistive technologies for the hearing impaired to more natural human-computer interaction.

Papers