Retinal segmentation is a crucial step in the early warning of human health conditions. However, retinal blood vessels possess complex curvature, irregular distribution, and contain multi-scale fine structures, which make the limited receptive field of regular convolution challenging to process their vascular details efficiently. Additionally, the encoder-decoder based network leads to irreversible spatial information loss because of multiple downsampling, resulting in over-segmentation and missed segmentation of the vessels. For this reason, we develop a high-resolution network based on Deformable Convolution v3, called HRD-Net. By constructing a high-resolution representation, the network allows special attention to be paid to the details of tiny blood vessels. The proposed feature enhancement cascade module based on Deformable Convolution v3 can flexibly adapt and capture the ever-changing morphology and intricate connections of retinal blood vessels, ensuring the continuity of vessel segmentation. In the output phase of the network, the proposed global aggregation module integrates full-resolution feature maps while suppressing redundant features, achieving an effective fusion of high-level semantic information and spatial detail information. In addition, we have re-examined the selection criteria for activation and normalization methods, and also refine the network architectures from a spatial domain perspective to release redundant computational loads. Testing on the DRIVE, STARE, and CHASE_DB1 datasets indicates that HRD-Net, with fewer parameters, outperforms existing segmentation methods on several evaluation metrics such as F1, ACC, SE, SP, AUC, and IOU.
Read full abstract