ABSTRACT The use of deep learning-based high resolution remote sensing image sea and land segmentation method has become prevalent in various fields such as environmental monitoring, resource assessment, and urban planning. However, the segmentation process faces challenges due to the complex coastline and the similarity in spectral colour difference between the sea and land in certain areas. The remote sensing images obtained from satellites often exhibit unclear boundaries between the sea and land, as well as intricate coastlines. Consequently, the segmentation of sea and land becomes a difficult task. The existing sea-land segmentation methods have limitations in accurately segmenting long and narrow waters, as well as areas with low differentiation between sea and land. In this paper, we propose a new sea-land segmentation method called SRMA (Swin-Res-Multi-Attention). SRMA performs parallel feature extraction using a dual-branch parallel: Swin-Transformer branch and ResNet branch. The combination of Swin-Transformer’s self-attention and ResNet’s residual design helps reduce the loss of feature information, thereby avoiding incorrect segmentation in regions with low distinctness. To improve overall stability, we introduce the MCA (Multiscale channel attention module) module, which combines feature information from different scales. The multi-scale pyramid design of MCA module ensures stability and improves segmentation accuracy in narrow waters. Experimental results on a public dataset demonstrate that our model outperforms existing methods. Additionally, we evaluate the model’s performance in complex scenarios by creating a new dataset that includes a complex urban water body and cloud interference. The results showcase the robustness and superiority of our model.
Read full abstract