A multimodal fusion model with multi-level attention mechanism for depression detection

Ming Fang,Siyu Peng,Yujia Liang,Chih-Cheng Hung,Shuhua Liu

doi:10.1016/j.bspc.2022.104561

Abstract

Depression is a common mental illness that affects the physical and mental health of hundreds of millions of people around the world. Therefore, designing an efficient and robust depression detection model is an urgent research task. In order to fully extract depression features, we systematically analyze audio-visual and text data related to depression, and proposes a multimodal fusion model with multi-level attention mechanism (MFM-Att) for depression detection. The method is mainly divided into two stages: the first stage utilizes two LSTMs and a Bi-LSTM with attention mechanism to learn multi-view audio feature, visual feature and rich text feature, respectively. In the second stage, the output features of the three modalities are sent into the attention fusion network (AttFN) to obtain effective depression information, aiming to make use of the diversity and complementarity between modalities for depression detection. It is worth noting that the multi-level attention mechanism can not only extract valuable depressive features of intra-modality, but also learn the correlations of inter-modality, thereby improving the overall performance of the model by reducing the influence of redundant information. MFM-Att model is evaluated on the DAIC-WOZ dataset, and the result outperforms state-of-the-art models in terms of root mean square error (RMSE).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multimodal fusion model with multi-level attention mechanism for depression detection

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Journal: Biomedical Signal Processing and Control	Publication Date: Dec 31, 2022
Citations: 33

Similar Papers

A bearing fault diagnosis method based on M-SSCNN and M-LR attention mechanism
Yonghua Li ... Zhihui Men
Structural Health Monitoring | VOL. -
Yonghua Li, et. al.Yonghua Li ... Zhihui Men
29 Apr 2024
Structural Health Monitoring | VOL. -

Experimental and predicted dual oximetry variability.
P Weir ... S J Barker
Journal of clinical monitoring | VOL. 9
P Weir, et. al.P Weir ... S J Barker
01 Sep 1993
Journal of clinical monitoring | VOL. 9

A Multi-modal Feature Layer Fusion Model for Assessment of Depression Based on Attention Mechanisms
Congcong Wang ... Decheng Liu
-
Congcong Wang, et. al.Congcong Wang ... Decheng Liu
05 Nov 2022
05 Nov 2022

MMDTA: A Multimodal Deep Model for Drug-Target Affinity with a Hybrid Fusion Strategy.
Kai-Yang Zhong ... Yi Li
Journal of chemical information and modeling | VOL. 64
Kai-Yang Zhong, et. al.Kai-Yang Zhong ... Yi Li
23 Aug 2023
Journal of chemical information and modeling | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multimodal fusion model with multi-level attention mechanism for depression detection

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control