Speech Enhancement

Speech enhancement aims to improve the clarity and intelligibility of speech signals degraded by noise and reverberation, crucial for applications like hearing aids and voice assistants. Current research focuses on developing computationally efficient models, including lightweight convolutional neural networks, recurrent neural networks (like LSTMs), and diffusion models, often incorporating techniques like multi-channel processing, attention mechanisms, and self-supervised learning to achieve high performance with minimal latency. These advancements are driving progress towards more robust and resource-efficient speech enhancement systems for a wide range of real-world applications, particularly in low-power devices and challenging acoustic environments. The field also explores the integration of visual information and advanced signal processing techniques to further enhance performance.

341papers

Papers - Page 13

November 10, 2022

Speech Enhancement with Fullband-Subband Cross-Attention Network
Speech Enhancement

November 8, 2022

DiffPhase: Generative Diffusion-based STFT Phase Retrieval
Diffusion Based Speech Enhancement Generative Question Speech Enhancement Phase Retrieval

November 5, 2022

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
Universal Speech Enhancement Speech Enhancement Copy Suppression Cross Task Knowledge Distillation Personalized Voice Activity Detection

November 4, 2022

October 31, 2022

Diffusion-based Generative Speech Source Separation
Diffusion Model Speech Enhancement Score Based Generative Source Separation Channel Source Separation

October 30, 2022

SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Speech Enhancement Time Domain Speech Enhancement Generative Model Audio Synthesis Iterative Refinement

October 28, 2022

Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Speech Enhancement Neural Speech Enhancement Source Filter Homomorphic Logistic Regression Training

October 27, 2022

October 26, 2022

Speech Enhancement

Papers - Page 13

Speech Enhancement with Fullband-Subband Cross-Attention Network

DiffPhase: Generative Diffusion-based STFT Phase Retrieval

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation

Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation

Speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle

Self-Supervised Learning for Speech Enhancement through Synthesis

Cold Diffusion for Speech Enhancement

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration

Analysis of Noisy-target Training for DNN-based speech enhancement

Inference and Denoise: Causal Inference-based Neural Speech Enhancement

Fast and efficient speech enhancement with variational autoencoders

A weighted-variance variational autoencoder model for speech enhancement

Diffusion-based Generative Speech Source Separation

SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement

Speech Enhancement with Intelligent Neural Homomorphic Synthesis

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement

Speaker Diarization Based on Multi-channel Microphone Array in Small-scale Meeting

Parallel Gated Neural Network With Attention Mechanism For Speech Enhancement

SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks