Robust Speaker Representation
Robust speaker representation focuses on creating speech embeddings that are resilient to noise, variations in speaking style, and differences in language or recording conditions, enabling accurate speaker identification and verification across diverse scenarios. Current research emphasizes self-supervised learning methods, often employing architectures like HuBERT and variations thereof, along with techniques like disentanglement learning and data augmentation to improve model robustness. These advancements are crucial for improving the accuracy and reliability of various speech technologies, including speaker verification systems, speech recognition, and emotion recognition, particularly in challenging real-world conditions.
Papers
June 30, 2024
June 17, 2024
June 9, 2024
June 4, 2024
November 27, 2023
November 4, 2023
September 14, 2023
September 5, 2023
August 5, 2023
November 1, 2022
October 28, 2022