Audio Embeddings
Audio embeddings are numerical representations of sound, aiming to capture both acoustic and semantic information for various applications like sound classification and retrieval. Current research focuses on developing robust and efficient embedding models, often leveraging deep neural networks such as transformers and convolutional neural networks, and exploring techniques like contrastive learning and knowledge distillation to improve performance and generalization across diverse audio datasets. This field is significant due to its potential to enhance numerous applications, including speech recognition, music information retrieval, and even mental health assessment, by enabling more accurate and efficient audio analysis.
Papers
September 1, 2023
August 22, 2023
July 21, 2023
June 30, 2023
June 23, 2023
June 21, 2023
June 9, 2023
April 30, 2023
April 22, 2023
March 3, 2023
January 6, 2023
November 20, 2022
October 7, 2022
October 6, 2022
September 30, 2022
May 16, 2022