Speaker Verification
Speaker verification (SV) aims to automatically authenticate a person's identity based on their voice, focusing on creating robust and accurate systems. Current research emphasizes improving the discriminative power of speaker embeddings through techniques like contrastive learning, disentangling confounding factors such as age and channel variations, and leveraging powerful pre-trained models such as WavLM and Whisper. These advancements are crucial for enhancing security in various applications, from access control to forensic investigations, and are driving ongoing efforts to improve robustness against spoofing attacks and noisy conditions.
Papers
Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Orchid Chetia Phukan, Devyani Koshal, Swarup Ranjan Behera, Arun Balaji Buduru, Rajesh Sharma
Guided Speaker Embedding
Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix