Abstract
Extracting buildings from remote sensing images plays an important role in urban development planning, disaster assessment and mapping. Convolutional neural network (CNN) has been widely applied to building extraction because of its powerful deep semantic feature extraction ability. However, existing CNN-based building extraction methods are difficult to accurately extract multiscale buildings with accurate edges because of the limitation of feature receptive fields and the loss of spatial detail information. For the above problems, this paper proposes a multiscale receptive field network (MSRF-Net) to accurately extract multiscale buildings from remote sensing images. MSRF-Net includes multiscale receptive field feature encoder (MRFF-Encoder) and multipath decoder. In the MRFF-Encoder, a multiscale attentional down (MSAD) module and asymmetric residual inception (ARI) module are proposed to capture multiscale receptive field features. In the multipath decoder, convolutions with different kernel size and dilation are used in three parallel paths to learn localization-preserved multiscale features with multiscale receptive field. What’s more, the features of different branches and MRFF-Encoder are fused by the proposed feature combination module, which contribute to capture context information of multiscale receptive field while recovering the resolution of feature space. The experimental results show that compared with the latest MAP-Net, MSRF-Net has achieved F1 score growth of 1.14%, 0.42%, 1.11% and IoU score growth of 1.68%, 0.76% and 1.64% respectively on Massachusetts data set, WHU data set and the Typical Cities Building data set.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Transactions on Geoscience and Remote Sensing
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.