Abstract

Now-a-days, the research is more towards the entertainment like music, songs, movies, etc. There are many existing works that suggest good songs, movies to people depending on their mood, nature and time that has been savior for the society during the days of lockdown. The existing algorithms used in the literature for basic clustering are K-means, TSNE (T- distributed Stochastic Neighborhood Embedding), PCA (Principal Component Analysis).In this paper, the music dataset considered, consists of songs with attributes as song name, genres, artists, mode, tempo, valence, year, liveness, loudness, popularity, acousticness, danceability, duration, energy, explicit, instrumentalness, key. The important feature is extracted from the other features with the support of literature survey i.e., number of music listeners, types of the songs and type of the music. Later, the dataset is divided into clusters using traditional technique that is k-means based on genre, an important attribute which is selected from the above attributes. The different classifier models like Random Forest, Extra Trees, LightGBM, XGBoost, CatBoost classifier are applied on the clustered dataset and the results have been evaluated on each individual algorithm. Thus the paper recommends not only the group of relevant songs but also suggests the best accurate classification algorithm that can be used for any mentioned musical dataset. The paper also compares all the said ensemble algorithms by calculating the precision, recall, f1-score and support. The accuracy is also calculated for all said ensemble algorithms and based on the accuracy the best suitable algorithm is suggested.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call