Cnn-trans model: A parallel dual-branch network for fundus image classification

Shuxian Liu,Wei Wang,Le Deng,Huan Xu

doi:10.1016/j.bspc.2024.106621

Abstract

The existence of fundus diseases not only endangers people’s vision, but also brings serious economic burden to the society. Fundus images are an objective and standard basis for the diagnosis of fundus diseases. With the continuous advancement of computer science, deep learning methods dominated by convolutional neural networks (CNN) have been widely used in fundus image classification. However, the current CNN-based fundus image classification research still has a lot of room for improvement: CNN cannot effectively avoid the interference of repeated background information and has limited ability to model the whole world. In response to the above findings, this paper proposes the CNN-Trans model. The CNN-Trans model is a parallel dual-branch network, which is the two branches of CNN-LSTM and Vision Transform (ViT). The CNN-LSTM branch uses Xception after transfer learning. As the original feature extractor, LSTM is responsible for dealing with the gradient disappearance problem in neural network iterations before the classification head, and then introduces a new type of lightweight attention mechanism between Xception and LSTM: Coordinate Attention, so as to emphasize the key information related to classification and suppress the less useful repeated background information; while the self-attention mechanism in the ViT branch is not limited by local interactions, it can establish long-distance dependence on the target and extract global features. Finally, the concatenation (Concat) operation is used to fuse the features of the two branches. The local features extracted by the CNN-LSTM branch and the global features extracted by the ViT branch form complementary advantages. After feature fusion, more comprehensive image feature information is sent to the to the classification layer. Finally, after a large number of experimental tests and comparisons, the results show that: the CNN-Trans model achieved an accuracy of 80.68% on the fundus image classification task, and the CNN-Trans model has a classification that is comparable to the state-of-the-art methods. performance..

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cnn-trans model: A parallel dual-branch network for fundus image classification

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Journal: Biomedical Signal Processing and Control	Publication Date: Jul 13, 2024
Citations: 1

Similar Papers

Deep Pyramid Convolutional Neural Network Integrated with Self-attention Mechanism and Highway Network for Text Classification
Xuewei Li ... Hongyun Ning
Journal of Physics: Conference Series | VOL. 1642
Xuewei Li, et. al.Xuewei Li ... Hongyun Ning
01 Sep 2020
Journal of Physics: Conference Series | VOL. 1642

Age-Related Macular Degeneration Detection in Retinal Fundus Images by a Deep Convolutional Neural Network
Andrés García-Floriano ... Elías Ventura-Molina
Mathematics | VOL. 12
Andrés García-Floriano, et. al.Andrés García-Floriano ... Elías Ventura-Molina
08 May 2024
Mathematics | VOL. 12

Detection and Classification of Diabetic Retinopathy Using DCNN and BSN Models
S Sudha ... T Gayathri Devi
Computers, Materials & Continua | VOL. 72
S Sudha, et. al.S Sudha ... T Gayathri Devi
01 Jan 2021
Computers, Materials & Continua | VOL. 72

Fundus Image Classification and Retinal Disease Localization with Limited Supervision
Qier Meng ... Yohei Hashimoto
-
Qier Meng, et. al.Qier Meng ... Yohei Hashimoto
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cnn-trans model: A parallel dual-branch network for fundus image classification

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control