Refined Diarization
Refined diarization aims to accurately segment and label audio recordings by speaker, improving upon the limitations of traditional methods. Current research emphasizes developing robust models that handle diverse acoustic conditions, including overlapping speech, multiple languages, and distant microphones, often employing neural networks, particularly end-to-end architectures and those incorporating large language models for post-processing. These advancements are crucial for improving the accuracy of automatic speech recognition and transcription in various applications, such as healthcare, meeting transcription, and media analysis, ultimately reducing manual effort and improving accessibility.
Papers
November 11, 2024
November 4, 2024
October 4, 2024
September 20, 2024
July 23, 2024
June 20, 2024
June 13, 2024
June 12, 2024
January 7, 2024
November 21, 2023
September 22, 2023
September 19, 2023
August 21, 2023
May 25, 2023
March 1, 2023
November 12, 2022
August 5, 2022
July 28, 2022
March 30, 2022