Vocal Tract

The vocal tract, the anatomical structure responsible for shaping sound produced by the vocal cords, is a central focus in speech science research. Current investigations utilize diverse approaches, including advanced signal processing models (like ARMAX-LF) and deep learning architectures (e.g., diffusion models and neural codecs), to analyze vocal tract dynamics from various data sources such as audio, MRI, and ultrasound. These efforts aim to improve speech synthesis, enhance speech recognition, and provide deeper insights into speech production mechanisms, with applications ranging from language learning to forensic speech analysis and clinical diagnostics. The development of large, multi-modal datasets is crucial for advancing these research goals.

Papers