A novel multimodal depression diagnosis approach utilizing a new hybrid fusion method

Xiufeng Zhang,Bingyi Li,Guobin Qi

doi:10.1016/j.bspc.2024.106552

Abstract

In recent years, research has found that the impact of depression status primarily lies in patients’ language expression and facial expressions. Furthermore, facial expressions and intonation in speech exhibit a natural coexistence, making facial and vocal information core recognition indicators in depression identification. It is imperative to explore the effective use of deep learning methods for multimodal depression detection. We have proposed a novel trilateral bimodal encoding model (MEN), attentional decision fusion (ADF), and feature extraction fusion strategy. We employed a hybrid fusion approach that combines early intra-modality fusion with late inter-modality fusion, for multimodal depression diagnosis. In the feature extraction fusion component, we combine different representations of the same modality before inputting them into the network for training, enhancing features relevant to depression in the data. Through our multimodal encoding network, we extract frame-level information using Convolutional Neural Networks (CNN) while considering long-term context information and dependencies with Bidirectional Long Short-Term Memory (BiLSTM). Finally, the three streams of information were effectively integrated through attention fusion representation in our Attention Decision Fusion module (ADF), for depression score regression prediction. Extensive experiments were conducted on two public datasets, AVEC2013 and AVEC2014. The average absolute error/ root mean squared error (MAE/RMSE) scores for predicting depression scores were 6.48/8.91 and 7.01/9.38, respectively. This demonstrated that our hybrid fusion method outperforms traditional early or late fusion methods in terms of performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel multimodal depression diagnosis approach utilizing a new hybrid fusion method

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Similar Papers

Comparing early and late data fusion methods for gene expression prediction
Matteo Re
Soft Computing | VOL. 15
Matteo ReMatteo Re
21 Mar 2010
Soft Computing | VOL. 15

Comparison of early and late fusion techniques for movie trailer genre labelling
J.H Mervitz ... J.P De Villiers
-
J.H Mervitz, et. al.J.H Mervitz ... J.P De Villiers
01 Jul 2020
01 Jul 2020

Diagnosis Framework for Probable Alzheimer's Disease and Mild Cognitive Impairment Based on Multi-Dimensional Emotion Features.
Chunchao Zhang ... Wenhao Ma
Journal of Alzheimer's disease : JAD | VOL. 97
Chunchao Zhang, et. al.Chunchao Zhang ... Wenhao Ma
30 Jan 2024
Journal of Alzheimer's disease : JAD | VOL. 97

Fusion of multiple deep convolutional neural networks (DCNNs) for improved segmentation of lung nodules in CT images
Yifan Wang ... Ravi K Samala
-
Yifan Wang, et. al.Yifan Wang ... Ravi K Samala
04 Apr 2022
04 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel multimodal depression diagnosis approach utilizing a new hybrid fusion method

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control