Cocktail Party
The "cocktail party effect," the ability to focus on a single conversation amidst background noise, is a central challenge in speech separation and target speaker extraction. Current research focuses on leveraging large language models (LLMs) to guide speech separation, using textual cues to identify target speakers and improve robustness, and developing new datasets with ground truth labels to better evaluate these models. This work is significant for advancing human-computer interaction, improving assistive technologies like hearing aids, and enhancing privacy-preserving techniques in federated learning where protecting individual data during model training is crucial.
Papers
October 22, 2024
October 13, 2024
September 27, 2024
August 13, 2024
May 26, 2024
March 5, 2024
February 27, 2024
January 23, 2024
October 11, 2023
June 1, 2023
May 25, 2023
March 24, 2023
February 7, 2023
December 1, 2022
November 12, 2022
September 12, 2022
August 15, 2022
March 31, 2022
December 17, 2021