Target Speaker Extraction
Target speaker extraction (TSE) aims to isolate a specific speaker's voice from overlapping speech mixtures, a crucial task for applications like hearing aids and personalized interfaces. Current research emphasizes improving robustness and generalization, focusing on model architectures like transformers and convolutional neural networks, often incorporating curriculum learning and data augmentation techniques to enhance performance, particularly in noisy or reverberant environments. The development of efficient and accurate TSE methods holds significant promise for advancing speech processing technologies and improving human-computer interaction in challenging acoustic scenarios.
Papers
November 5, 2024
October 21, 2024
October 1, 2024
September 29, 2024
September 24, 2024
September 15, 2024
September 12, 2024
September 4, 2024
September 2, 2024
July 12, 2024
July 1, 2024
June 18, 2024
June 13, 2024
June 12, 2024
January 29, 2024
January 8, 2024
December 18, 2023
October 12, 2023
October 11, 2023