Speaker Separation
Speaker separation aims to isolate individual voices from a mixture of sounds, a crucial task for applications like speech recognition in noisy environments and virtual meetings. Current research focuses on developing robust deep learning models, including neural networks employing attention mechanisms, and those integrating audio and visual information, to handle multiple speakers, reverberation, and missing data. These advancements leverage techniques like complex spectral mapping, spatial activity analysis, and speaker embeddings to improve separation accuracy and efficiency, impacting fields ranging from assistive hearing technologies to music information retrieval.
Papers
November 21, 2024
November 13, 2024
October 28, 2024
August 19, 2024
July 27, 2024
March 6, 2024
January 30, 2024
January 8, 2024
November 15, 2023
May 31, 2023
March 13, 2023
November 22, 2022
October 23, 2022
October 12, 2022
September 14, 2022
July 9, 2022
April 19, 2022