Speaker Change
Speaker change detection (SCD) focuses on accurately identifying transitions between speakers in audio recordings, a crucial task for applications like automatic speech recognition and transcription. Current research emphasizes improving SCD accuracy using various deep learning models, including transformer-transducer architectures and those incorporating both speaker-specific and content-based information, often leveraging self-supervised learning techniques. These advancements are driving improvements in real-time speech processing, particularly for multi-speaker scenarios such as meetings and broadcast media, and are also informing related tasks like spoken language change detection.
Papers
September 18, 2023
September 15, 2023
August 22, 2023
August 4, 2023
May 29, 2023
February 16, 2023
February 10, 2023
November 17, 2022
November 11, 2022
November 8, 2022
October 26, 2022
July 25, 2022
June 27, 2022
May 24, 2022
May 14, 2022