A multi-stage dynamical fusion network for multimodal emotion recognition.

Sihan Chen,Li Zhu,Wanzeng Kong,Jiajia Tang

doi:10.1007/s11571-022-09851-w

Abstract

In recent years, emotion recognition using physiological signals has become a popular research topic. Physiological signal can reflect the real emotional state for individual which is widely applied to emotion recognition. Multimodal signals provide more discriminative information compared with single modal which arose the interest of related researchers. However, current studies on multimodal emotion recognition normally adopt one-stage fusion method which results in the overlook of cross-modal interaction. To solve this problem, we proposed a multi-stage multimodal dynamical fusion network (MSMDFN). Through the MSMDFN, the joint representation based on cross-modal correlation is obtained. Initially, the latent and essential interactions among various features extracted independently from multiple modalities are explored based on specific manner. Subsequently, the multi-stage fusion network is designed to split the fusion procedure into multi-stages using the correlation observed before. This allows us to exploit much more fine-grained unimodal, bimodal and trimodal intercorrelations. For evaluation, the MSMDFN was verified on multimodal benchmark DEAP. The experiments indicate that our method outperforms the related one-stage multi-modal emotion recognition works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multi-stage dynamical fusion network for multimodal emotion recognition.

Abstract

Talk to us

Similar Papers

More From: Cognitive neurodynamics

Lead the way for us

Journal: Cognitive neurodynamics	Publication Date: Jul 31, 2022
Citations: 13

Similar Papers

Emotion Recognition using Multimodal Residual LSTM Network
Jiaxin Ma ... Wei-Long Zheng
-
Jiaxin Ma, et. al.Jiaxin Ma ... Wei-Long Zheng
15 Oct 2019
15 Oct 2019

Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition
Hengshun Zhou ... Qing Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Hengshun Zhou, et. al.Hengshun Zhou ... Qing Wang
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

A multimodal fusion emotion recognition method based on multitask learning and attention mechanism
Jinbao Xie ... Yury I Varatnitski
Neurocomputing | VOL. 556
Jinbao Xie, et. al.Jinbao Xie ... Yury I Varatnitski
04 Aug 2023
Neurocomputing | VOL. 556

Multi-modal fusion network with complementarity and importance for emotion recognition
Shuai Liu ... Weiping Ding
Information Sciences | VOL. 619
Shuai Liu, et. al.Shuai Liu ... Weiping Ding
18 Nov 2022
Information Sciences | VOL. 619

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-stage dynamical fusion network for multimodal emotion recognition.

Abstract

Talk to us

Similar Papers

More From: Cognitive neurodynamics