ABSTRACTThis paper introduces a novel edge detection operator called ‘vision transformer-based Dexinet (ViT-DexiNet)’ to address the challenges of detecting small objects in synthetic aperture radar (SAR) images. SAR images are typically impacted by strong multiplicative noise, making edge detection difficult. Existing traditional methods have limited spectral data preservation capabilities and often result in a loss of clarity and integrity of salient features in SAR images. The proposed ViT-DexiNet operator employs a series of interconnected layers to extract and refine salient edge features from SAR images. It utilizes a vision transformer self-attention layer to capture the pattern and structural details of image features crucial for determining edges. The extracted feature maps are then processed by the DexiNet architecture, which consists of a dense block, transfer block, and upsampling network. This architecture helps preserve edge information at different scales in deeper layers. The series of layered blocks generate edge maps, which are concatenated and averaged through smoothing to remove noise and enhance edge details in SAR images, resulting in a final high-quality edge map. To evaluate the proposed ViT-DexiNet method, both qualitative and quantitative analyses are conducted using standard edge detection operators such as Canny and Sobel. The empirical results demonstrate that the ViT-DexiNet surpasses baseline edge detection operators. The achieved values of proposed edge detection operator are 97.92%, 97.72%, 97.64% and 97.41%, respectively for the metrics accuracy, precision, recall and f1-score. The ViT-DexiNet offers high-quality edge maps, simplifying the interpretation of data for small object detection. Overall, the ViT-DexiNet method shows promise in overcoming the limitations of traditional approaches and improving the detection of edges in SAR images.
Read full abstract