Speech Segment
Speech segment analysis focuses on extracting meaningful information from discrete portions of spoken audio, aiming to improve various speech-related applications. Current research emphasizes developing robust models, such as transformer networks and graph convolutional networks, to handle challenges like noise, speaker variability, and overlapping speech, often incorporating multimodal data (audio-visual) and self-supervised learning techniques for improved performance. These advancements are driving progress in diverse fields, including mental health assessment, speech-to-speech translation, and speaker diarization, by enabling more accurate and efficient processing of spoken language.
Papers
September 22, 2022
August 11, 2022
July 25, 2022
June 9, 2022
May 19, 2022
May 9, 2022
March 26, 2022
February 15, 2022
December 15, 2021