Speaker Recognition System
Speaker recognition systems aim to automatically identify individuals based on their voice, with applications ranging from security systems to virtual assistants. Current research heavily focuses on improving robustness against adversarial attacks (e.g., voice morphing, spoofing) and addressing biases stemming from training data limitations, often employing deep neural networks like ResNets, ECAPA-TDNN, and transformer-based models such as WavLM. These advancements are crucial for enhancing the reliability and security of voice-based biometric systems, impacting fields like forensics, healthcare, and personal security.
Papers
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Guangke Chen, Zhe Zhao, Fu Song, Sen Chen, Lingling Fan, Feng Wang, Jiashui Wang
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems
Guangke Chen, Zhe Zhao, Fu Song, Sen Chen, Lingling Fan, Yang Liu