Synthetic Speech Detector
Synthetic speech detection aims to distinguish artificially generated speech from human speech, addressing the growing concern of malicious use of realistic AI-generated audio. Current research focuses on developing robust detectors using deep learning architectures like Transformers and convolutional neural networks, often incorporating techniques such as attention mechanisms and feature fusion to improve accuracy and generalization across diverse datasets and speech synthesis methods. This field is crucial for combating audio deepfakes and misinformation, with ongoing efforts concentrating on improving detector robustness to compression, noise, and various synthesis techniques, as well as mitigating biases in detection algorithms.
Papers
October 9, 2024
October 6, 2024
September 19, 2024
August 26, 2024
June 25, 2024
April 17, 2024
February 22, 2024
February 8, 2024
October 21, 2022
October 6, 2022
September 15, 2022