Abstract
The building extraction from synthetic aperture radar (SAR) images has always been a challenging research topic. Recently, the deep convolution neural network brings excellent improvements in SAR segmentation. The fully convolutional network and other variants are widely transferred to the SAR studies because of their high precision in optical images. They are still limited by their processing in terms of the geometric distortion of buildings, the variability of building structures, and scattering interference between adjacent targets in the SAR images. In this article, a unified framework called selective spatial pyramid dilated (SSPD) network is proposed for the fine building segmentation in SAR images. First, we propose a novel encoder–decoder structure for the fine building feature reconstruction. The enhanced encoder and the dual-stage decoder, composed of the CBM and the SSPD module, extract and recover the crucial multiscale information better. Second, we design the multilayer SSPD module based on the selective spatial attention. The multiscale building information with different attention on multiple branches is combined, optimized, and adaptively selected for adaptive filtering and extracting features of complex multiscale building targets in SAR images. Third, according to the building features and SAR imaging mechanism, a new loss function called L-shape weighting loss (LWloss) is proposed to heighten the attention on the L-shape footprint characteristics of the buildings and reduce the missing detection of line buildings. Besides, LWloss can also alleviate the class imbalance problem in the optimization stage. Finally, the experiments on a large-scene SAR image dataset demonstrate the effectiveness of the proposed method and verify its superiority over other approaches, such as the region-based Markov random field, U-net, and DeepLabv3+.
Highlights
T HE building is a significant topographic object class in the city and a momentous data layer in the geographic information system
We demonstrate the effectiveness of our model in the fine building segmentation on a Gaofen-3 satellite synthetic aperture radar (SAR) dataset, and achieve the 91.2% accuracy performance on the test set without any postprocessing
The prediction results with L-shape weighting loss (LWloss), by contrast, are more sensitive to L-shape features, and the missing detection of some small L-shape targets decreases
Summary
T HE building is a significant topographic object class in the city and a momentous data layer in the geographic information system. Manuscript received September 20, 2020; revised January 26, 2021; accepted April 24, 2021. Automatic extraction of buildings from aerial remote sensing images is frequently used for surveying and mapping of ground objects, detection of illegal buildings, urban ecological planning, and regional development. Recent developments in building segmentation have heightened the need for fine extraction. Some works [8], [9] are limited by low-resolution images, resulting in the chaotic extraction effects. To obtain the precise boundary, positions, and scales of the buildings, it is of great need to fulfill the fine segmentation of buildings based on the high-resolution SAR images
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.