Paper ID: 2407.21068

Exploring Genre and Success Classification through Song Lyrics using DistilBERT: A Fun NLP Venture

Servando Pizarro Martinez, Moritz Zimmermann, Miguel Serkan Offermann, Florian Reither

This paper presents a natural language processing (NLP) approach to the problem of thoroughly comprehending song lyrics, with particular attention on genre classification, view-based success prediction, and approximate release year. Our tests provide promising results with 65\% accuracy in genre classification and 79\% accuracy in success prediction, leveraging a DistilBERT model for genre classification and BERT embeddings for release year prediction. Support Vector Machines outperformed other models in predicting the release year, achieving the lowest root mean squared error (RMSE) of 14.18. Our study offers insights that have the potential to revolutionize our relationship with music by addressing the shortcomings of current approaches in properly understanding the emotional intricacies of song lyrics.

Submitted: Jul 28, 2024