Speaker Information
Speaker information extraction and utilization are central to advancing speech processing, aiming to identify and isolate individual speakers within audio recordings, regardless of background noise or overlapping speech. Current research focuses on developing robust models, often employing transformer-based architectures and techniques like prompt learning, to achieve this, particularly in challenging scenarios with multiple speakers or low-resource languages. These advancements have significant implications for applications such as meeting transcription, voice assistants, and personalized speech technologies, improving accessibility and enhancing user experience.
Papers
March 2, 2023
February 20, 2023
January 18, 2023
October 20, 2022
June 28, 2022
June 20, 2022
June 14, 2022
June 6, 2022
May 17, 2022
April 26, 2022
March 31, 2022
March 30, 2022
March 29, 2022
March 16, 2022
January 15, 2022
November 28, 2021
November 7, 2021
November 5, 2021