Speaker Extraction
Speaker extraction aims to isolate a target speaker's voice from a mixture of sounds, a crucial task with applications in enhancing speech intelligibility and enabling more robust speech processing systems. Current research focuses on developing sophisticated deep learning models, often employing attention mechanisms and incorporating multi-scale or multi-modal information (audio-visual, spatial cues) to improve accuracy and robustness in challenging acoustic environments. These advancements are driving progress in areas like personalized acoustic echo cancellation and improving the performance of downstream tasks such as speech recognition and diarization.
Papers
September 4, 2024
June 12, 2024
April 29, 2024
October 7, 2023
September 19, 2023
June 28, 2023
June 5, 2023
March 14, 2023
March 13, 2023
March 9, 2023
January 31, 2023
December 14, 2022
November 4, 2022
October 31, 2022
October 9, 2022
July 10, 2022
April 15, 2022
April 4, 2022