Speaker Discriminability
Speaker discriminability focuses on developing methods to reliably distinguish between different speakers in audio recordings, a crucial task for applications like speaker verification and diarization. Current research emphasizes improving the robustness of speaker embeddings, often leveraging contrastive learning and novel loss functions to enhance the separability of speaker representations in high-dimensional spaces, with architectures like ECAPA-TDNN and SepFormer being prominent. These advancements aim to improve the accuracy and reliability of speaker identification and separation in challenging conditions, such as overlapping speech or variations in speaking style, impacting fields ranging from security to meeting transcription.
Papers
January 16, 2023
November 18, 2022
October 29, 2022
June 28, 2022
June 16, 2022