Synthetic Speech Detector

Synthetic speech detection aims to distinguish artificially generated speech from human speech, addressing the growing concern of malicious use of realistic AI-generated audio. Current research focuses on developing robust detectors using deep learning architectures like Transformers and convolutional neural networks, often incorporating techniques such as attention mechanisms and feature fusion to improve accuracy and generalization across diverse datasets and speech synthesis methods. This field is crucial for combating audio deepfakes and misinformation, with ongoing efforts concentrating on improving detector robustness to compression, noise, and various synthesis techniques, as well as mitigating biases in detection algorithms.

Papers

December 26, 2024

Improving Generalization for AI-Synthesized Voice Detection
Hainan Ren, Lin Li, Chun-Hao Liu, Xin Wang, Shu Hu
Speaker Identity Domain Invariant Representation Modern Vocoders Enhancing Generalization Synthetic Speech Detector

December 17, 2024

Synthetic Speech Classification: IEEE Signal Processing Cup 2022 challenge
Mahieyin Rahmun, Rafat Hasan Khan, Tanjim Taharat Aurpa, Sadia Khan, Zulker Nayeen Nahiyan, Mir Sayad Bin Almas, Rakibul Hasan Rajib, Syeda Sakira Hassan
Challenge Task Gaussian Mixture Model Signal Processing Synthetic Speech Detector

October 9, 2024

Can DeepFake Speech be Reliably Detected?
Hongbin Liu, Youzheng Chen, Arun Narayanan, Athula Balachandran, Pedro J. Moreno, Lun Wang
Synthesized Speech Voice Cloning Adversarial Threat Synthetic Speech Detector Speech Deepfakes

October 6, 2024

SONAR: A Synthetic AI-Audio Detection Framework~and Benchmark
Xiang Li, Pin-Yu Chen, Wenqi Wei
Text to Speech Low Cost Obstacle Avoidance Sonar Audio Detection Synthetic Speech Detector

September 19, 2024

DiffSSD: A Diffusion-Based Dataset For Speech Forensics
Kratika Bhagtani, Amit Kumar Singh Yadav, Paolo Bestagini, Edward J. Delp
Synthesized Speech Synthetic Speech Detector

August 26, 2024

SONICS: Synthetic Or Not -- Identifying Counterfeit Songs
Md Awsafur Rahman, Zaber Ibn Abdul Hakim, Najibul Haque Sarker, Bishmoy Paul, Shaikh Anowarul Fattah
Large Scale Synthetic Fake Audio Detection Synthetic Speech Detector

June 25, 2024

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng
Synthesized Speech Multi Head Self Attention Channel Wise Synthetic Speech Detection Synthetic Speech Detector

April 17, 2024

FairSSD: Understanding Bias in Synthetic Speech Detectors
Amit Kumar Singh Yadav, Kratika Bhagtani, Davide Salvi, Paolo Bestagini, Edward J. Delp
Absolute Stance Bias Synthesized Speech Speech Signal Human Speech Language Disorder Synthetic Speech Detector

February 22, 2024

Compression Robust Synthetic Speech Detection Using Patched Spectrogram Transformer
Amit Kumar Singh Yadav, Ziyue Xiang, Kratika Bhagtani, Paolo Bestagini, Stefano Tubaro, Edward J. Delp
Synthesized Speech Audio Spectrogram Transformer Synthetic Speech Detection Synthetic Speech Detector

February 8, 2024

Listening Between the Lines: Synthetic Speech Detection Disregarding Verbal Content
Davide Salvi, Temesgen Semu Balcha, Paolo Bestagini, Stefano Tubaro
Synthesized Speech Best Fit Line Synthetic Speech Detection Verbal Communication Synthetic Speech Detector Audio Forensics

October 21, 2022

Adaptive re-calibration of channel-wise features for Adversarial Audio Classification
Vardhan Dongre, Abhinav Thimma Reddy, Nikhitha Reddeddy
Synthesized Speech Deepfake Audio Channel Wise Synthetic Speech Detection Adversarial Audio Synthetic Speech Detector Adaptive Calibration

October 6, 2022

The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection
Daniele Mari, Federica Latora, Simone Milani
High Efficiency Speech Synthesis Sound Design Audio Processing Robust Detection Synthetic Speech Detection Single Channel Audio Synthetic Speech Detector First Digit

September 15, 2022

Detecting Synthetic Speech Manipulation in Real Audio Recordings
Md Hafizur Rahman, Martin Graciarena, Diego Castan, Chris Cobo-Kroenke, Mitchell McLaren, Aaron Lawson
Deep Fake Synthesized Speech Speech Generation Audio Recording Synthetic Speech Detection Synthetic Speech Detector

May 16, 2022

Transferability of Adversarial Attacks on Synthetic Speech Detection
Jiacheng Deng, Shunyi Chen, Li Dong, Diqun Yan, Rangding Wang
Adversarial Attack Task Transferability Synthetic Speech Detection Synthetic Speech Detector