Speech Processing System

Speech processing systems aim to enable computers to understand and generate human speech, encompassing tasks like speech recognition, synthesis, and understanding. Current research emphasizes improving model efficiency (e.g., through linear-complexity alternatives to self-attention), robustness to noise and real-world conditions (including leveraging weakly supervised learning and adapting to noisy data), and expanding capabilities to diverse languages and applications (such as analyzing children's speech development and detecting speaker changes). These advancements are crucial for enhancing accessibility, improving human-computer interaction, and creating more accurate and reliable applications across various fields, from healthcare to assistive technologies.

Papers