Phoneme Sequence

Phoneme sequence analysis focuses on understanding how sequences of speech sounds are processed, generated, and manipulated in speech and language technologies. Current research emphasizes developing robust models, such as transformer-based architectures and generative transducers, to improve the accuracy and controllability of speech synthesis and recognition, often incorporating techniques like attention mechanisms and label smoothing to mitigate errors. This work is crucial for advancing applications like text-to-speech systems, machine translation for dubbing, and building more robust spoken language understanding systems that are less susceptible to automatic speech recognition errors.

Papers