Spontaneous Speech

Spontaneous speech research focuses on understanding and modeling the complexities of naturally occurring conversation, aiming to improve automatic speech recognition (ASR) and applications like Alzheimer's disease detection. Current research employs diverse machine learning models, including transformers, convolutional neural networks, and recurrent neural networks, often incorporating multimodal data (audio and text) and advanced techniques like attention mechanisms and data augmentation to enhance performance. This field is significant for advancing ASR technology across multiple languages and for developing reliable diagnostic tools for neurological disorders, leveraging the rich information embedded within spontaneous speech patterns.

Papers