Towards enhancing emotion recognition via multimodal framework

C Akalya Devi,Shweta Yadav,D Karthika Renuka,Krishnaprasad Thirunarayan,D Harish,G Pooventhiran

doi:10.3233/jifs-220280

Abstract

Emotional AI is the next era of AI to play a major role in various fields such as entertainment, health care, self-paced online education, etc., considering clues from multiple sources. In this work, we propose a multimodal emotion recognition system extracting information from speech, motion capture, and text data. The main aim of this research is to improve the unimodal architectures to outperform the state-of-the-arts and combine them together to build a robust multi-modal fusion architecture. We developed 1D and 2D CNN-LSTM time-distributed models for speech, a hybrid CNN-LSTM model for motion capture data, and a BERT-based model for text data to achieve state-of-the-art results, and attempted both concatenation-based decision-level fusion and Deep CCA-based feature-level fusion schemes. The proposed speech and mocap models achieve emotion recognition accuracies of 65.08% and 67.51%, respectively, and the BERT-based text model achieves an accuracy of 72.60%. The decision-level fusion approach significantly improves the accuracy of detecting emotions on the IEMOCAP and MELD datasets. This approach achieves 80.20% accuracy on IEMOCAP which is 8.61% higher than the state-of-the-art methods, and 63.52% and 61.65% in 5-class and 7-class classification on the MELD dataset which are higher than the state-of-the-arts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards enhancing emotion recognition via multimodal framework

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Journal: Journal of Intelligent & Fuzzy Systems	Publication Date: Jan 30, 2023
Citations: 2

Similar Papers

Multi-head attention fusion networks for multi-modal speech emotion recognition
Junfeng Zhang ... Kesheng Wang
Computers & Industrial Engineering | VOL. 168
Junfeng Zhang, et. al.Junfeng Zhang ... Kesheng Wang
10 Mar 2022
Computers & Industrial Engineering | VOL. 168

GraspDB14 – Documentation on a database of grasp motions and its creation
...
-
, et. al. ...
12 Jan 2018
12 Jan 2018

In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–
Yeşim Ülgen Sönmez ... Asaf Varol
Intelligent Systems with Applications | VOL. 22
Yeşim Ülgen Sönmez, et. al.Yeşim Ülgen Sönmez ... Asaf Varol
11 Mar 2024
Intelligent Systems with Applications | VOL. 22

Radar modeling and validation of human gaits using simultaneous collections of motion capture and radar data
Ryan Hersey ... Lamar Westbrook
-
Ryan Hersey, et. al.Ryan Hersey ... Lamar Westbrook
01 Nov 2013
01 Nov 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards enhancing emotion recognition via multimodal framework

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems