Audio Alignment
Audio alignment, the process of synchronizing audio and textual data, is crucial for various applications, including speech recognition, speaker diarization, and multimodal emotion recognition. Current research focuses on developing robust alignment algorithms, such as dynamic time warping and multiple sequence alignment, and leveraging these alignments for data augmentation techniques to improve model performance in tasks with limited training data. These advancements are significantly impacting fields like speech technology and music information retrieval by enabling more accurate and efficient processing of audio-textual information.
Papers
September 15, 2023
September 14, 2023
July 26, 2022
May 6, 2022
March 28, 2022
March 16, 2022