Disease Classification Model Based on Multi-Modal Feature Fusion

Zhengyu Wan,Xinhui Shao

doi:10.1109/access.2023.3252011

Abstract

Aiming at the application of classification tasks in deep learning in disease classification, this paper proposes a disease classification model based on multi-modal feature fusion. In this model, chest X-ray images of patients are used as image modal data, and corresponding disease descriptions are used as text modal data. By innovatively pro-posing an adaptive multi-modal attention mechanism, the feature vectors extracted from the two modal data are fused for classification by classifier. In order to verify the effectiveness of the proposed model in disease classification, this paper uses Chest X-ray dataset in open I database. In order to solve the problem of small sample size and unbalanced sample categories in this dataset, SMOTE algorithm is used to expand the samples and ablation study is designed to compare the model effects. The results show that the model based on both image and text modes and sample expansion by SMOTE algorithm can solve the problems of overfitting, low recall and F1 value due to small samples and unbalanced samples. In addition, the classification accuracy of multi-modal model using image and text is improved by about 0.55% and 2.69% respectively compared with single-modal model using only image or text. Similarly, the addition of adaptive multi-modal attention mechanism also improves the classification effect of the model by about 0.41% compared with the feature fusion method using vector concatenation simply.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Disease Classification Model Based on Multi-Modal Feature Fusion

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2023
License type: CC BY-NC-ND 4.0

Similar Papers

Multimodal Feature Fusion Based Hypergraph Learning Model.
Zhe Yang ... Liangkui Xu
Computational intelligence and neuroscience | VOL. 2022
Zhe Yang, et. al.Zhe Yang ... Liangkui Xu
16 May 2022
Computational intelligence and neuroscience | VOL. 2022

Multi-modal feature fusion with multi-head self-attention for epileptic EEG signals.
Ning Huang ... Zhengtao Xi
Mathematical biosciences and engineering : MBE | VOL. 21
Ning Huang, et. al.Ning Huang ... Zhengtao Xi
01 Jan 2024
Mathematical biosciences and engineering : MBE | VOL. 21

MFUR-Net: Multimodal feature fusion and unimodal feature refinement for RGB-D salient object detection
Zhengqian Feng ... Mingle Zhou
Knowledge-Based Systems | VOL. 299
Zhengqian Feng, et. al.Zhengqian Feng ... Mingle Zhou
31 May 2024
Knowledge-Based Systems | VOL. 299

Exponential Multi-Modal Discriminant Feature Fusion for Small Sample Size
Yanmin Zhu ... Shuzhi Su
IEEE Access | VOL. 10
Yanmin Zhu, et. al.Yanmin Zhu ... Shuzhi Su
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Disease Classification Model Based on Multi-Modal Feature Fusion

Abstract

Talk to us

Similar Papers

More From: IEEE Access