Face Voice
Face-voice association research focuses on establishing robust links between a person's facial features and their voice, aiming to improve multimodal biometric systems and applications like speaker identification and virtual human creation. Current research emphasizes developing sophisticated models, often employing contrastive learning, multimodal encoders, and techniques like fusion and orthogonal projection, to handle challenges such as multilingual speech and limited data. These advancements are significant for improving the accuracy and efficiency of audio-visual systems across diverse applications, including security, entertainment, and accessibility technologies.
Papers
August 4, 2024
July 29, 2024
April 15, 2024
April 14, 2024
September 25, 2023
September 18, 2023
July 26, 2023
March 10, 2023
February 27, 2023
December 1, 2022
September 24, 2022
August 22, 2022
April 28, 2022