MAGDRA: A Multi-modal Attention Graph Network with Dynamic Routing-By-Agreement for multi-label emotion recognition

Xingye Li,Jin Liu,Yurong Xie,Peizhu Gong,Xiliang Zhang,Huihua He

doi:10.1016/j.knosys.2023.111126

Abstract

Multimodal multi-label emotion recognition (MMER) is a vital yet challenging task in affective computing. Despite significant progress in previous works, there are three limitations: (i) Limited applicability in real-world scenarios due to the assumption of pre-alignment of multimodal data. (ii) Inadequate utilization of long-term dependencies across modalities. (iii) Insufficient exploitation of correlations among emotion labels. In this paper, to overcome these limitations, a Multi-modal Attention Graph model with Dynamic Routing-by-Agreement (MAGDRA) is proposed. In MAGDRA, the fusion of multi-modal data can be performed without pre-alignment via pseudo-alignment algorithm (PAA). Furthermore, an Expectation-maximized Cross-modal Temporal (ECT) fusion approach is presented to effectively learn the cross-modal interactions and long-term dependencies among visual, audio and textual data. Moreover, to conquer the underconsideration of the correlation among multiple labels, a Reinforced Multi-Label Emotion Detection (RMLED) module is carefully designed. Extensive experiments are conducted on three public benchmark datasets, IEMOCAP, CMU-MOSI, and CMU-MOSEI, the results demonstrate that MAGDRA outperforms the existing methods and has the potential to generalize to multi-modal multi-label tasks in other domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MAGDRA: A Multi-modal Attention Graph Network with Dynamic Routing-By-Agreement for multi-label emotion recognition

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 1, 2023
Citations: 7

Similar Papers

A multimodal fusion emotion recognition method based on multitask learning and attention mechanism
Jinbao Xie ... Yury I Varatnitski
Neurocomputing | VOL. 556
Jinbao Xie, et. al.Jinbao Xie ... Yury I Varatnitski
04 Aug 2023
Neurocomputing | VOL. 556

Contextual and Cross-Modal Interaction for Multi-Modal Speech Emotion Recognition
Dingkang Yang ... Lihua Zhang
IEEE Signal Processing Letters | VOL. 29
Dingkang Yang, et. al.Dingkang Yang ... Lihua Zhang
01 Jan 2021
IEEE Signal Processing Letters | VOL. 29

STERM: A Multimodal Speech Emotion Recognition Model in Filipino Gaming Settings
Giorgio Armani G Magno ... Lhuijee Jhulo V Cuchapin
-
Giorgio Armani G Magno, et. al.Giorgio Armani G Magno ... Lhuijee Jhulo V Cuchapin
01 Dec 2022
01 Dec 2022

Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection
Xincheng Ju ... Dong Zhang
-
Xincheng Ju, et. al.Xincheng Ju ... Dong Zhang
12 Oct 2020
12 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MAGDRA: A Multi-modal Attention Graph Network with Dynamic Routing-By-Agreement for multi-label emotion recognition

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems