Articulatory Inversion
Articulatory inversion aims to reconstruct the movements of speech articulators (tongue, lips, etc.) from audio recordings, bridging the gap between acoustic signals and the physical mechanisms of speech production. Current research heavily utilizes deep learning models, often incorporating self-supervised learning and multi-channel attention mechanisms to improve speaker-independent performance and handle diverse speech characteristics, including dysarthric speech. This field is significant for advancing our understanding of speech production, enabling applications such as realistic avatar animation, improved speech synthesis, and potentially assisting in the diagnosis and treatment of speech disorders.
Papers
July 3, 2024
June 25, 2024
March 9, 2024
October 25, 2023
October 16, 2023
September 17, 2023
September 3, 2023
June 1, 2023
February 26, 2023
February 14, 2023
October 30, 2022
October 29, 2022
April 2, 2022
March 11, 2022