Acoustic Modeling
Acoustic modeling focuses on representing and manipulating audio signals, primarily for speech and music processing tasks like speech recognition, text-to-speech synthesis, and music generation. Current research emphasizes developing robust models using deep neural networks, including transformer-based architectures, normalizing flows, and diffusion models, often incorporating techniques like self-supervised learning and contextual information (e.g., from dialogue history or simulated future frames) to improve accuracy and efficiency. These advancements are driving improvements in various applications, from building more accurate and efficient speech recognition systems to creating more natural-sounding synthetic speech and music.
Papers
October 17, 2024
May 22, 2024
April 25, 2024
February 29, 2024
February 13, 2024
October 24, 2023
September 6, 2023
July 31, 2023
June 18, 2023
May 27, 2023
May 25, 2023
April 23, 2023
November 13, 2022
November 2, 2022
September 25, 2022
June 16, 2022
May 20, 2022
March 31, 2022