Abstract

A raga is a unique set of notes with certain rules that carefully followed, retain and protect its purity and produce amazing musical effects. An automated raga transcription and identification is important for computational musicology, which is an important step for musicology for indexing, classifying, and recommending tunes. In the present research, the audio features such as mel frequency cepstrum coefficients (MFCCs), spectral flux, short time energy, audio feature extractor, and spectral centroid features are used for the prediction of a raga. The model showed more complexity which means it required lots of training data. The proposed enhanced spatial bound whale optimization algorithm (ESBWOA) is used that overcome the feature selection problem of high dimensional features. In addition to this, a weighted salp swarm algorithm (SSA) is used for selecting the tone-based features from the ragas based on amplitude or each raga sample. The features were fed for bidirectional long short-term memory (Bi-LSTM) network, which enhanced the success rate for raga identification and classification. The present research uses CompMusic dataset in the research work where 9 classes for Carnatic music and 7 classes in Hindustani music are considered for the classification of ragas.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call