Classification of Diabetic Retinopathy Based on Efficient Computational Modeling

Jiao Xue,Jianyu Wu,Yingxu Bian,Shiyan Zhang,Qinsheng Du

doi:10.3390/app142311327

Jiao Xue, Jianyu Wu + Show 3 more

Open Access

https://doi.org/10.3390/app142311327

Copy DOI

Export

Save

Cite

Journal: Applied Sciences	Publication Date: Dec 4, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Convolutional neural networks (CNN) and Vision Transformers (ViT) have long been the main backbone networks for visual classification in the field of deep learning. Although ViT has recently received more attention than CNN due to its excellent fitting ability, their scalability is largely limited by the quadratic complexity of attention computation. For the determination of diabetic retinopathy, the fundus lesions as well as the width, angle, and branching pattern of retinal blood vessels are characterized, inspired by the ability of Mamba and VMamba to efficiently model long sequences, VMamba-m is proposed in this paper. This is a generalized visual skeleton model designed to reduce computational complexity to linear while retaining the advantageous features of ViTs. By modifying the cross-entropy loss function, we enhance the model’s attention to rare categories, especially in large-scale multi-category classification tasks. In order to enhance the adaptability of the VMamba-m model in processing visual data, we introduce the se channel attention mechanism, which enables the model to learn features in the channel dimension and form the importance of each channel. Finally, different weights are assigned to each channel through the incentive part. In addition to this, this paper further improves the implementation details and architectural design by introducing a novel attention mechanism implemented based on the local windowing method, which aims to optimize the model’s ability in processing long sequence data to enhance the performance of VMamba-m and improve its inference speed. Extensive experimental results show that VMamba-m performs well in the retinopathy V classification task, and it has significant advantages in terms of accuracy and computation time over existing benchmark models.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Classification of Diabetic Retinopathy Based on Efficient Computational Modeling

Abstract

Published Version

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Spatiotemporal Co-Attention Hybrid Neural Network for Pedestrian Localization Based on 6D IMU
Yingying Wang ... Max Q.-H. Meng
IEEE Transactions on Automation Science and Engineering | VOL. 20
Yingying Wang, et. al.Yingying Wang ... Max Q.-H. Meng
01 Jan 2023
IEEE Transactions on Automation Science and Engineering | VOL. 20

Conv-RGNN: An efficient Convolutional Residual Graph Neural Network for ECG classification
Yupeng Qiang ... Jianhong Dou
Computer Methods and Programs in Biomedicine | VOL. 257
Yupeng Qiang, et. al.Yupeng Qiang ... Jianhong Dou
03 Sep 2024
Computer Methods and Programs in Biomedicine | VOL. 257

Comparison of Attention Mechanism in Convolutional Neural Networks for Binary Classification of Breast Cancer Histopathological Images
Marcin Ziąber ... Karol Przystalski
-
Marcin Ziąber, et. al.Marcin Ziąber ... Karol Przystalski
01 Jan 2023
01 Jan 2023

Super-resolution reconstruction of binocular image based on multi-level fusion attention network
Lei Xu ... Huihui Song
Journal of Image and Graphics | VOL. 28
Lei Xu, et. al.Lei Xu ... Huihui Song
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Classification of Diabetic Retinopathy Based on Efficient Computational Modeling

Abstract

Published Version

Talk to us

Similar Papers

More From: Applied Sciences