Music emotion recognition based on segment-level two-stage learning

Na He,Sam Ferguson

doi:10.1007/s13735-022-00230-z

Abstract

In most Music Emotion Recognition (MER) tasks, researchers tend to use supervised learning models based on music features and corresponding annotation. However, few researchers have considered applying unsupervised learning approaches to labeled data except for feature representation. In this paper, we propose a segment-based two-stage model combining unsupervised learning and supervised learning. In the first stage, we split each music excerpt into contiguous segments and then utilize an autoencoder to generate segment-level feature representation. In the second stage, we feed these time-series music segments to a bidirectional long short-term memory deep learning model to achieve the final music emotion classification. Compared with the whole music excerpts, segments as model inputs could be the proper granularity for model training and augment the scale of training samples to reduce the risk of overfitting during deep learning. Apart from that, we also apply frequency and time masking to segment-level inputs in the unsupervised learning part to enhance training performance. We evaluate our model on two datasets. The results show that our model outperforms state-of-the-art models, some of which even use multimodal architectures. And the performance comparison also evidences the effectiveness of audio segmentation and the autoencoder with masking in an unsupervised way.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Multimedia Information Retrieval	Publication Date: Apr 25, 2022
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

Music emotion recognition based on segment-level two-stage learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Multimedia Information Retrieval

Lead the way for us

Similar Papers

Music Emotion Recognition Based on a Neural Network with an Inception-GRU Residual Structure
Xiao Han ... Fuyang Chen
Electronics | VOL. 12
Xiao Han, et. al.Xiao Han ... Fuyang Chen
15 Feb 2023
Electronics | VOL. 12

A fuzzy inference-based music emotion recognition system
Sanghoon Jun ... Eenjun Hwang
-
Sanghoon Jun, et. al. Sanghoon Jun ... Eenjun Hwang
01 Jan 2008
01 Jan 2008

Music emotion recognition based on two-level support vector classification
Chingshun Lin ... Weiwei Hsiung
-
Chingshun Lin, et. al.Chingshun Lin ... Weiwei Hsiung
01 Jul 2016
01 Jul 2016

The Role of Time in Music Emotion Recognition: Modeling Musical Emotions from Time-Varying Music Features
Marcelo Caetano ... Athanasios Mouchtaris
-
Marcelo Caetano, et. al.Marcelo Caetano ... Athanasios Mouchtaris
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Music emotion recognition based on segment-level two-stage learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Multimedia Information Retrieval