Disfluency Detection

Disfluency detection focuses on identifying and correcting interruptions in speech, such as repetitions or hesitations, improving the accuracy and efficiency of speech processing systems. Current research emphasizes multimodal approaches, combining acoustic and visual data with advanced architectures like transformers and graph convolutional networks, to enhance detection accuracy and address data scarcity through techniques like synthetic data generation. This work is crucial for improving automatic speech recognition, conversational AI, and applications in speech therapy, as accurate disfluency detection facilitates better natural language understanding and more effective human-computer interaction.

Papers