Speech Bandwidth Extension

Speech bandwidth extension (BWE) aims to enhance the quality and intelligibility of narrowband speech by artificially reconstructing higher frequencies. Recent research focuses on developing efficient and high-quality BWE models using neural networks, particularly generative adversarial networks (GANs) and diffusion models, often incorporating parallel processing of amplitude and phase information to improve speed and accuracy. These advancements are improving speech quality metrics and showing potential benefits in applications like speaker verification, where BWE can boost performance without introducing significant artifacts. The development of flexible, real-time capable BWE algorithms is a key area of ongoing investigation.

Papers