Audio Modality
Audio modality research focuses on understanding and utilizing audio data for various applications, primarily aiming to improve the accuracy and efficiency of tasks involving sound. Current research emphasizes multimodal approaches, integrating audio with visual or textual data using techniques like cross-attention mechanisms and contrastive learning within deep learning frameworks, often leading to improved performance over unimodal methods. This field is significant because it enables advancements in diverse areas, including speaker verification, sound source localization, and medical diagnosis through analysis of respiratory sounds, ultimately impacting fields like healthcare, assistive technologies, and multimedia processing.
Papers
October 19, 2024
June 10, 2024
May 15, 2024
November 28, 2023
September 28, 2023
August 9, 2023
August 2, 2023
June 21, 2023
May 24, 2023
March 12, 2023
February 27, 2023
June 11, 2022
May 25, 2022
March 2, 2022