Speech Signal

Speech signals are the acoustic representations of spoken language, and research focuses on improving their processing for various applications. Current efforts concentrate on developing robust models for speech enhancement (e.g., using diffusion models and state-space models like Mamba), source separation (leveraging techniques like attention mechanisms and incorporating spatial information), and accurate recognition, even in noisy or challenging environments. These advancements have significant implications for improving human-computer interaction, assistive technologies for individuals with hearing impairments, and applications in healthcare (e.g., disease detection using speech biomarkers) and security (e.g., synthetic speech detection).

Papers