Disfluency Generation
Disfluency generation focuses on creating artificial speech containing hesitations, repetitions, and other naturally occurring imperfections found in human conversation. Current research emphasizes leveraging large language models to generate realistic disfluent text, often employing techniques like prompt engineering and data augmentation to overcome limitations of existing, often small and imbalanced, datasets. This work is crucial for improving speech synthesis, enhancing automatic speech recognition and disfluency detection systems, and providing valuable resources for studying speech disorders like stuttering and Alzheimer's disease, where disfluencies can be diagnostic indicators.
Papers
March 31, 2024
March 13, 2024
January 18, 2024
November 16, 2022
October 29, 2022
April 4, 2022
March 18, 2022