Audio Text Retrieval
Audio-text retrieval (ATR) focuses on developing systems that can efficiently retrieve audio clips based on textual descriptions, and vice versa. Current research emphasizes improving the accuracy and robustness of ATR by exploring advanced architectures like transformers and diffusion models, addressing challenges such as handling temporal information within audio, and mitigating the impact of noisy or misaligned training data through techniques like contrastive learning and adversarial training. ATR's advancements have significant implications for various applications, including multimedia search, content creation, and assistive technologies, by enabling more intuitive and effective interaction with audio-visual data.
Papers
October 21, 2024
September 16, 2024
September 14, 2024
September 1, 2024
July 25, 2024
June 11, 2024
May 31, 2024
May 16, 2024
May 1, 2024
March 16, 2024
March 15, 2024
November 2, 2023
September 16, 2023
August 29, 2023
July 28, 2023
March 19, 2023
March 14, 2023
November 12, 2022
November 8, 2022