Video multimodal emotion recognition based on Bi-GRU and attention fusion

Ruo-Hong Huan,Peng Chen,Rong-Hua Liang,Jia Shu,Sheng-Lin Bao,Kai-Kai Chi

doi:10.1007/s11042-020-10030-4

Abstract

A video multimodal emotion recognition method based on Bi-GRU and attention fusion is proposed in this paper. Bidirectional gated recurrent unit (Bi-GRU) is applied to improve the accuracy of emotion recognition in time contexts. A new network initialization method is proposed and applied to the network model, which can further improve the video emotion recognition accuracy of the time-contextual learning. To overcome the weight consistency of each modality in multimodal fusion, a video multimodal emotion recognition method based on attention fusion network is proposed. The attention fusion network can calculate the attention distribution of each modality at each moment in real-time so that the network model can learn multimodal contextual information in real-time. The experimental results show that the proposed method can improve the accuracy of emotion recognition in three single modalities of textual, visual, and audio, meanwhile improve the accuracy of video multimodal emotion recognition. The proposed method outperforms the existing state-of-the-art methods for multimodal emotion recognition in sentiment classification and sentiment regression.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Video multimodal emotion recognition based on Bi-GRU and attention fusion

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Oct 31, 2020
Citations: 35

Similar Papers

Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning
Yanan Shang ... Tianqi Fu
Intelligent Systems with Applications | VOL. 24
Yanan Shang, et. al.Yanan Shang ... Tianqi Fu
08 Sep 2024
Intelligent Systems with Applications | VOL. 24

E-MFNN: an emotion-multimodal fusion neural network framework for emotion recognition.
Zhuen Guo ... Chen Yang
PeerJ Computer Science | VOL. 10
Zhuen Guo, et. al.Zhuen Guo ... Chen Yang
19 Apr 2024
PeerJ Computer Science | VOL. 10

Research on Emotion Recognition Method of Flight Training Based on Multimodal Fusion
Wendong Wang ... Zhibin Zhang
International Journal of Human–Computer Interaction | VOL. ahead-of-print
Wendong Wang, et. al.Wendong Wang ... Zhibin Zhang
16 Sep 2023
International Journal of Human–Computer Interaction | VOL. ahead-of-print

Students' classroom Emotion Analysis Based on Intelligent Recognition
Bin Fan ... Xiaojing Zeng
-
Bin Fan, et. al.Bin Fan ... Xiaojing Zeng
01 Nov 2022
01 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video multimodal emotion recognition based on Bi-GRU and attention fusion

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications