Synthetic Voice
Synthetic voice generation, aiming to create realistic artificial speech, is rapidly advancing, driven by deep learning techniques and models like WaveNet, Tacotron, and Transformer-based architectures. Current research focuses on improving the naturalness and expressiveness of synthetic voices, including emotional nuance and accurate representation of diverse accents and speakers, while simultaneously developing robust detection methods to counter the potential misuse of this technology in deepfakes and other malicious applications. The ability to both generate highly realistic synthetic speech and reliably detect it has significant implications for security, forensics, accessibility, and the entertainment industry.
Papers
October 9, 2024
August 30, 2024
July 24, 2024
July 11, 2024
July 7, 2024
June 11, 2024
May 11, 2024
April 7, 2024
March 17, 2024
March 4, 2024
January 25, 2024
January 17, 2024
January 8, 2024
January 4, 2024
December 28, 2023
December 22, 2023
October 22, 2023
October 8, 2023
September 25, 2023
September 15, 2023