Speaker Diarization
Speaker diarization is the task of identifying "who spoke when" in an audio recording, a crucial preprocessing step for many speech applications. Current research focuses on improving accuracy and efficiency, particularly in challenging scenarios like multi-speaker conversations and noisy environments, using techniques such as end-to-end neural networks, spectral clustering, and the integration of audio-visual or semantic information. These advancements are driving progress in areas like meeting transcription, multilingual speech processing, and improving the performance of downstream tasks such as automatic speech recognition.
Papers
October 7, 2022
September 24, 2022
September 20, 2022
August 27, 2022
August 17, 2022
August 5, 2022
July 28, 2022
July 25, 2022
July 13, 2022
July 1, 2022
June 17, 2022
June 9, 2022
June 6, 2022
May 19, 2022
May 16, 2022
April 26, 2022
April 24, 2022
April 18, 2022