Speech Deepfakes

Speech deepfakes, artificial audio generated using machine learning, pose a significant threat to authenticity and security. Current research focuses on developing robust detection methods, employing diverse approaches such as mixture-of-experts models, leveraging pre-trained models like WavLM, and analyzing higher-level features like breath patterns or prosody alongside speaker verification techniques. These efforts aim to improve detection accuracy across various datasets and deepfake generation methods, addressing the challenge of reliably distinguishing real from synthetic speech for applications ranging from security to combating misinformation.

Papers