Speaker Information
Speaker information extraction and utilization are central to advancing speech processing, aiming to identify and isolate individual speakers within audio recordings, regardless of background noise or overlapping speech. Current research focuses on developing robust models, often employing transformer-based architectures and techniques like prompt learning, to achieve this, particularly in challenging scenarios with multiple speakers or low-resource languages. These advancements have significant implications for applications such as meeting transcription, voice assistants, and personalized speech technologies, improving accessibility and enhancing user experience.
Papers
October 8, 2024
September 12, 2024
August 18, 2024
July 16, 2024
June 26, 2024
May 22, 2024
April 1, 2024
March 6, 2024
February 17, 2024
January 24, 2024
December 20, 2023
November 27, 2023
November 6, 2023
September 18, 2023
June 29, 2023
June 28, 2023
May 31, 2023
May 22, 2023