Target Speaker Extraction
Target speaker extraction (TSE) aims to isolate a specific speaker's voice from overlapping speech mixtures, a crucial task for applications like hearing aids and personalized interfaces. Current research emphasizes improving robustness and generalization, focusing on model architectures like transformers and convolutional neural networks, often incorporating curriculum learning and data augmentation techniques to enhance performance, particularly in noisy or reverberant environments. The development of efficient and accurate TSE methods holds significant promise for advancing speech processing technologies and improving human-computer interaction in challenging acoustic scenarios.
Papers
September 15, 2023
June 28, 2023
June 25, 2023
March 15, 2023
February 15, 2023
January 16, 2023
December 10, 2022
November 1, 2022
October 31, 2022
October 28, 2022
October 27, 2022
June 18, 2022
March 30, 2022
February 21, 2022