Streaming Voice Conversion
Streaming voice conversion aims to transform a speaker's voice into another in real-time, overcoming the limitations of non-streaming methods that require processing the entire utterance. Current research focuses on developing efficient streaming architectures, such as those based on Conformers and non-autoregressive transformers, often incorporating techniques like hybrid predictive coding and knowledge distillation to mitigate the lack of future context inherent in streaming processing. These advancements are crucial for real-time applications like voice assistants and live communication systems, improving both the speed and quality of voice conversion.
Papers
September 27, 2023
May 21, 2023
October 27, 2022