Language Based Audio Retrieval
Language-based audio retrieval focuses on developing systems that can accurately retrieve audio segments matching a given text description. Current research emphasizes improving retrieval accuracy through advanced model architectures like dual encoders and transformer-based approaches, often incorporating contrastive learning and techniques to address data imbalances, such as paraphrasing and data augmentation. These advancements are driven by the need for more robust and efficient methods for indexing and searching large audio datasets, with applications ranging from improved multimedia search to enhanced accessibility for individuals with visual impairments.
Papers
October 29, 2024
October 10, 2024
September 24, 2024
August 21, 2024
May 1, 2024
January 16, 2024
January 10, 2024
September 20, 2023
August 8, 2023
October 6, 2022
September 20, 2022
August 24, 2022
July 8, 2022