Speaker Similarity
Speaker similarity research focuses on accurately representing and manipulating speaker characteristics in speech signals, primarily aiming to improve speech separation, voice conversion, and text-to-speech (TTS) systems. Current research emphasizes developing robust models, such as those based on transformers, normalizing flows, and diffusion models, that are less sensitive to variations in pitch and other speaker-specific features, even with limited training data. These advancements are crucial for enhancing the performance of various speech technologies, particularly in applications like multi-speaker speech recognition, personalized TTS, and voice cloning, where accurate speaker identification and differentiation are paramount.
Papers
September 6, 2023
May 30, 2023
March 7, 2023
February 6, 2023
November 4, 2022
October 28, 2022
July 11, 2022
July 1, 2022
June 28, 2022
May 17, 2022
April 8, 2022
April 3, 2022
January 20, 2022
December 14, 2021
November 24, 2021