Speech Waveform
Speech waveform research focuses on understanding and manipulating the raw audio signal of speech, aiming to improve speech processing technologies. Current research emphasizes using deep learning models, such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformers, and generative adversarial networks (GANs), often applied directly to raw waveforms without intermediate feature extraction, to achieve tasks like speech synthesis, recognition, and enhancement. These advancements have significant implications for applications ranging from improved hearing aids and voice assistants to more accurate forensic speaker identification and the development of more natural-sounding synthetic speech.
Papers
November 11, 2024
June 12, 2024
June 2, 2024
May 5, 2024
February 27, 2024
October 2, 2023
September 26, 2023
September 13, 2023
August 8, 2023
June 1, 2023
May 13, 2023
February 14, 2023
December 8, 2022
October 17, 2022
June 27, 2022
March 29, 2022
March 21, 2022
March 16, 2022
February 7, 2022