Multi Speaker
Multi-speaker research focuses on developing robust systems capable of processing and understanding audio and video containing multiple simultaneous speakers. Current efforts concentrate on improving speech separation and recognition techniques, often employing deep neural networks like Conformers and Transformers, along with innovative training methods such as Serialized Output Training and speaker-aware CTC. These advancements are crucial for applications ranging from meeting transcription and voice assistants to improving accessibility for individuals with hearing impairments, driving significant progress in both speech processing and human-computer interaction.
Papers
September 24, 2024
September 19, 2024
September 16, 2024
September 13, 2024
September 1, 2024
August 25, 2024
July 27, 2024
July 22, 2024
July 13, 2024
July 4, 2024
July 1, 2024
June 20, 2024
April 27, 2024
April 10, 2024
February 14, 2024
February 2, 2024
November 20, 2023
October 31, 2023
October 18, 2023