Speaker Similarity
Speaker similarity research focuses on accurately representing and manipulating speaker characteristics in speech signals, primarily aiming to improve speech separation, voice conversion, and text-to-speech (TTS) systems. Current research emphasizes developing robust models, such as those based on transformers, normalizing flows, and diffusion models, that are less sensitive to variations in pitch and other speaker-specific features, even with limited training data. These advancements are crucial for enhancing the performance of various speech technologies, particularly in applications like multi-speaker speech recognition, personalized TTS, and voice cloning, where accurate speaker identification and differentiation are paramount.
Papers
October 14, 2024
October 1, 2024
September 14, 2024
July 22, 2024
July 8, 2024
July 7, 2024
June 26, 2024
June 22, 2024
June 21, 2024
June 13, 2024
June 12, 2024
June 10, 2024
June 8, 2024
April 25, 2024
March 6, 2024
March 4, 2024
December 27, 2023
December 14, 2023