Abstract

For Bengali music emotion classification, deep learning models, particularly CNN and RNN are frequently used. To extract meaningful knowledge, however, past studies' shortcomings of low accuracy and overfitting have to be addressed. We have proposed a model combining Conv1D, Bi-GRU and the Bahdanau attention mechanism for music emotion classification of our Bengali music dataset. The model integrates distinct MFCCs wav preprocessing methods with deep learning methods and attention-based methods. The attention mechanism has increased the accuracy of the proposed classification model. The music is finally classified into one of the four emotion classes: Angry, Happy, Relax, Sad. The proposed Conv1D+BiGRU+Attention model is validated as more effective and efficient at classifying emotions in the Bengali music dataset than baseline methods, according to comparisons with baseline models. For our Bengali music dataset, the performance of our proposed model is 95%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call