Fake Audio Detection
Fake audio detection research aims to develop robust methods for identifying synthetic or manipulated audio, combating the spread of misinformation and protecting against security threats. Current efforts focus on improving the generalization of detection models across diverse datasets and spoofing techniques, employing architectures like transformers, convolutional neural networks, and mask autoencoders, often leveraging pre-trained models for efficient feature extraction. This field is crucial for safeguarding audio authenticity in various applications, from speaker verification and forensic analysis to combating deepfakes and protecting against malicious audio manipulation.
Papers
An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio
Xinrui Yan, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Haoxin Ma, Tao Wang, Shiming Wang, Ruibo Fu
Fully Automated End-to-End Fake Audio Detection
Chenglong Wang, Jiangyan Yi, Jianhua Tao, Haiyang Sun, Xun Chen, Zhengkun Tian, Haoxin Ma, Cunhang Fan, Ruibo Fu