Music Caption
Music captioning aims to automatically generate natural language descriptions of music, bridging the gap between audio and textual representations. Current research focuses on developing models that capture both global and fine-grained musical characteristics, often leveraging large language models and contrastive learning techniques to improve the alignment between audio features and textual descriptions, and addressing challenges like data scarcity through data augmentation and synthetic data generation. This field is significant for improving music information retrieval, enabling more effective music recommendation systems, and facilitating a deeper understanding of music's structure and emotional content.
Papers
October 13, 2024
October 3, 2024
September 16, 2024
September 15, 2024
July 29, 2024
September 15, 2023
August 22, 2023
July 31, 2023
January 14, 2023