Abstract
Deep learning methods have been successfully applied in medical image classification, segmentation and detection tasks. The U-Net architecture has been widely applied for these tasks. In this paper, we propose a U-Net variant for improved vessel segmentation in retinal fundus images. Firstly, we design a minimal U-Net (Mi-UNet) architecture, which drastically reduces the parameter count to 0.07M compared to 31.03M for the conventional U-Net. Moreover, based on Mi-UNet, we propose Salient U-Net (S-UNet), a bridge-style U-Net architecture with a saliency mechanism and with only 0.21M parameters. S-UNet uses a cascading technique that employs the foreground features of one net block as the foreground attention information of the next net block. This cascading leads to enhanced input images, inheritance of the learning experience of previous net blocks, and hence effective solution of the data imbalance problem. S-UNet was tested on two benchmark datasets, DRIVE and CHASE_DB1, with image sizes of 584 × 565 and 960 × 999, respectively. S-UNet was tested on the TONGREN clinical dataset with image sizes of 1880 × 2816. The experimental results show superior performance in comparison to other state-of-theart methods. Especially, for whole-image input from the DRIVE dataset, S-UNet achieved a Matthews correlation coefficient (MCC), an area under curve (AUC), and an F1 score of 0.8055, 0.9821, and 0.8303, respectively. The corresponding scores for the CHASE_DB1 dataset were 0.8065, 0.9867, and 0.8242, respectively. Moreover, our model shows an excellent performance on the TONGREN clinical dataset. In addition, S-UNet segments images of low, medium, and high resolutions in just 33ms, 91ms and 0.49s, respectively. This shows the real-time applicability of the proposed model.
Highlights
Diagnosis is crucial for many diseases that lead to human vision deterioration, such as glaucoma, hypertension and diabetic retinopathy [1], [2]
The main contribution of this paper is that we propose Salient U-Net (S-UNet), a bridge-style deep learning architecture that uses a cascading approach to apply the foreground features of one minimal U-Net (Mi-UNet) block as the foreground salient information of the Mi-UNet block to enhance the input images and inherit the learning experiences of the previous Mi-UNet blocks
While U-Net operates on image patches, Mi-UNet takes whole images as input
Summary
Diagnosis is crucial for many diseases that lead to human vision deterioration, such as glaucoma, hypertension and diabetic retinopathy [1], [2]. Ophthalmologists typically examine retinal fundus images to assess the clinical condition of the retinal blood vessels, which is an important indicator. For the diagnosis of various ophthalmic diseases. Manual labeling of retinal vessels in these images is timeconsuming, tedious and requires high clinical experience. Real-time automatic segmentation of retinal blood vessels is highly needed [3], and has attracted great attention in recent decades [4]. Existing retinal vessel segmentation methods can be divided into unsupervised and supervised methods [5].
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.