Audio Text
Audio-text research focuses on bridging the gap between audio and textual representations, aiming to improve tasks like audio generation, retrieval, and captioning. Current efforts concentrate on developing large-scale, temporally-aligned datasets with rich annotations and employing transformer-based models, contrastive learning, and diffusion models to achieve better alignment and understanding of audio-text relationships. These advancements are significant for improving human-computer interaction, accessibility technologies, and multimedia applications by enabling more nuanced and accurate processing of audio information.
Papers
September 15, 2024
July 5, 2024
July 3, 2024
June 27, 2024
June 25, 2024
May 1, 2024
April 27, 2024
March 7, 2024
February 21, 2024
February 10, 2024
January 12, 2024
January 5, 2024
November 13, 2023
November 2, 2023
September 16, 2023
September 14, 2023
May 29, 2023
May 22, 2023