Abstract

Automatic building extraction based on high-resolution aerial images has important applications in urban planning and environmental management. In recent years advances and performance improvements have been achieved in building extraction through the use of deep learning methods. However, the design of existing models focuses attention to improve accuracy through an overflowing number of parameters and complex structure design, resulting in large computational costs during the learning phase and low inference speed. To address these issues, we propose a new, efficient end-to-end model, called ARC-Net. The model includes residual blocks with asymmetric convolution (RBAC) to reduce the computational cost and to shrink the model size. In addition, dilated convolutions and multi-scale pyramid pooling modules are utilized to enlarge the receptive field and to enhance accuracy. We verify the performance and efficiency of the proposed ARC-Net on the INRIA Aerial Image Labeling dataset and WHU building dataset. Compared to available deep learning models, the proposed ARC-Net demonstrates better segmentation performance with less computational costs. This indicates that the proposed ARC-Net is both effective and efficient in automatic building extraction from high-resolution aerial images.

Highlights

  • Automatic extraction of buildings based on aerial images is of great importance in a broad range of application fields including urban planning, change detection, map services, and disaster management [1,2,3,4,5]

  • EXPERIMENTAL RESULTS ON THE INRIA DATASET We first conduct the comparisons on the INRIA dataset between the ARC-Net model and the well-known models including SegNet, U-Net, ENet, and ERFNet

  • The proposed ARC-Net shows significantly less false positives and false negatives than the other models and is able to maintain a high degree of completeness in building segmentation on the INRIA dataset

Read more

Summary

Introduction

Automatic extraction of buildings based on aerial images is of great importance in a broad range of application fields including urban planning, change detection, map services, and disaster management [1,2,3,4,5]. The efficiency and accuracy of automatic building extraction are still difficult archive and remain a challenging objective which attracts huge research interests [6]. Many mathematical descriptors have been introduced to extract the spatial and textural features of an image, such as Histogram of Oriented Gradients [7], Haar spaces [8], Grey Level Co-occurrence Matrix [9], and Local Binary Patterns [10]. Several machine learning classifiers have been employed for a pixel-by-pixel analysis, including Random Forests [11], VOLUME XX, 2017

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.