A Multimodal Framework for Large-Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals

Guanghao Yin,Kejun Zhang,Shouqian Sun,Dian Yu,Dejian Li

doi:10.1145/3490686

Abstract

Considerable attention has been paid to physiological signal-based emotion recognition in the field of affective computing. For reliability and user-friendly acquisition, electrodermal activity (EDA) has a great advantage in practical applications. However, EDA-based emotion recognition with large-scale subjects is still a tough problem. The traditional well-designed classifiers with hand-crafted features produce poorer results because of their limited representation abilities. And the deep learning models with auto feature extraction suffer the overfitting drop-off because of large-scale individual differences. Since music has a strong correlation with human emotion, static music can be involved as the external benchmark to constrain various dynamic EDA signals. In this article, we make an attempt by fusing the subject’s individual EDA features and the external evoked music features. And we propose an end-to-end multimodal framework, the one-dimensional residual temporal and channel attention network (RTCAN-1D). For EDA features, the channel-temporal attention mechanism for EDA-based emotion recognition is first involved in mine the temporal and channel-wise dynamic and steady features. The comparisons with single EDA-based SOTA models on DEAP and AMIGOS datasets prove the effectiveness of RTCAN-1D to mine EDA features. For music features, we simply process the music signal with the open-source toolkit openSMILE to obtain external feature vectors. We conducted systematic and extensive evaluations. The experiments on the current largest music emotion dataset PMEmo validate that the fusion of EDA and music is a reliable and efficient solution for large-scale emotion recognition.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Multimodal Framework for Large-Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Mar 4, 2022
Citations: 17

Similar Papers

The linguistic structure of an emotional text influences the sympathetic activity and the speech prosody of the reader
Alberto Greco ... Enzo Pasquale Scilingo
Biomedical Signal Processing and Control | VOL. 89
Alberto Greco, et. al.Alberto Greco ... Enzo Pasquale Scilingo
28 Nov 2023
Biomedical Signal Processing and Control | VOL. 89

Feature Extraction and Selection for Emotion Recognition from Electrodermal Activity
Joan Oliver ... Jainendra Shukla
IEEE Transactions on Affective Computing | VOL. 12
Joan Oliver, et. al.Joan Oliver ... Jainendra Shukla
01 Oct 2021
IEEE Transactions on Affective Computing | VOL. 12

Stress Detection Using Eye Tracking Data: An Evaluation of Full Parameters
Mansoureh Seyed Yousefi ... Mohammad Reza Daliri
IEEE Access | VOL. 10
Mansoureh Seyed Yousefi, et. al.Mansoureh Seyed Yousefi ... Mohammad Reza Daliri
01 Jan 2021
IEEE Access | VOL. 10

Emotion Recognition With Knowledge Graph Based on Electrodermal Activity.
Xiangmin Xu ... Kailing Guo
Frontiers in neuroscience | VOL. 16
Xiangmin Xu, et. al.Xiangmin Xu ... Kailing Guo
09 Jun 2022
Frontiers in neuroscience | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Multimodal Framework for Large-Scale Emotion Recognition by Fusing Music and Electrodermal Activity Signals

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications