Discriminative Speaker
Discriminative speaker representation focuses on creating speaker embeddings that effectively distinguish individuals from one another in speech data, crucial for applications like speaker verification and diarization. Current research emphasizes improving the discriminative power of these embeddings through techniques like contrastive learning, often within architectures such as SimCLR and variations of ResNet, and by incorporating attention mechanisms to focus on salient speaker-specific features. These advancements aim to enhance the accuracy and robustness of speaker recognition systems, impacting fields ranging from forensic science to personalized user interfaces.
Papers
August 28, 2024
April 23, 2024
February 11, 2024
September 23, 2023
September 17, 2023
January 22, 2023
October 29, 2022
August 15, 2022
August 1, 2022
July 11, 2022