Correlation mining of multimodal features based on higher-order partial least squares for emotion recognition in conversations

Yuanqing Li,Dianwei Wang,Wuwei Wang,Jiao Wang,Jie Fang

doi:10.1016/j.engappai.2024.109350

Abstract

In fields requiring an understanding of emotions, such as digital human interaction and public opinion analysis, achieving a dependable and interpretable model for mining correlations among multimodal features remains a primary objective. However, current deep learning methods often lack transparency and suffer from low interpretability. To address these challenges, we propose a novel Correlation Mining method based on Higher-Order Partial Least Squares (HOPLS) for multimodal Emotion Recognition in conversations (CMHER) in this paper. CMHER innovatively combines HOPLS with transformers and Gated Recurrent Units (GRUs) to compute correlation matrices within unimodal data streams and between cross-modal sources. HOPLS projects source data into a latent space to predict target data via correlation matrix computations, eliminating the need for Graphical Processing Unit (GPU) acceleration and making it suitable for experimental and edge systems. The integration of HOPLS with deep neural networks involves preprocessing multimodal features into suitable dimensions and latent representations, followed by HOPLS computing correlation matrices for cross-modal latent vectors and final labels through optimal joint subspace approximation, which aims at the improvements of both interpretability and reliability. Additionally, a generalization error fitting module further refines the predicted correlation matrices to improve predictive capability and overall model performance. Experiments on two public datasets validate the superiority of our proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Correlation mining of multimodal features based on higher-order partial least squares for emotion recognition in conversations

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Similar Papers

Deep emotion recognition in textual conversations: a survey
Patrícia Pereira ... Joao Paulo Carvalho
Artificial Intelligence Review | VOL. 58
Patrícia Pereira, et. al.Patrícia Pereira ... Joao Paulo Carvalho
07 Nov 2024
Artificial Intelligence Review | VOL. 58

Emotion Recognition in Conversations: A Survey Focusing on Context, Speaker Dependencies, and Fusion Methods
Yao Fu ... Chi Zhang
Electronics | VOL. 12
Yao Fu, et. al.Yao Fu ... Chi Zhang
20 Nov 2023
Electronics | VOL. 12

Emotion recognition in conversations with emotion shift detection based on multi-task learning
Qingqing Gao ... Jiuxin Cao
Knowledge-Based Systems | VOL. 248
Qingqing Gao, et. al.Qingqing Gao ... Jiuxin Cao
25 Apr 2022
Knowledge-Based Systems | VOL. 248

DVDGCN: Modeling Both Context-Static and Speaker-Dynamic Graph for Emotion Recognition in Multi-speaker Conversations
Shuofeng Zhao ... Pengyuan Liu
-
Shuofeng Zhao, et. al.Shuofeng Zhao ... Pengyuan Liu
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Correlation mining of multimodal features based on higher-order partial least squares for emotion recognition in conversations

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence