Abstract

Abstract. Building detection and footprint extraction are highly demanded for many remote sensing applications. Though most previous works have shown promising results, the automatic extraction of building footprints still remains a nontrivial topic, especially in complex urban areas. Recently developed extensions of the CNN framework made it possible to perform dense pixel-wise classification of input images. Based on these abilities we propose a methodology, which automatically generates a full resolution binary building mask out of a Digital Surface Model (DSM) using a Fully Convolution Network (FCN) architecture. The advantage of using the depth information is that it provides geometrical silhouettes and allows a better separation of buildings from background as well as through its invariance to illumination and color variations. The proposed framework has mainly two steps. Firstly, the FCN is trained on a large set of patches consisting of normalized DSM (nDSM) as inputs and available ground truth building mask as target outputs. Secondly, the generated predictions from FCN are viewed as unary terms for a Fully connected Conditional Random Fields (FCRF), which enables us to create a final binary building mask. A series of experiments demonstrate that our methodology is able to extract accurate building footprints which are close to the buildings original shapes to a high degree. The quantitative and qualitative analysis show the significant improvements of the results in contrast to the multy-layer fully connected network from our previous work.

Highlights

  • 1.1 Related workBuilding detection and footprint extraction are important remote sensing tasks and used in the fields of urban planning and reconstruction, infrastructure development, three-dimensional (3D) building model generation, etc

  • As a continuation of our previous work in this paper we present a methodology using a deep learning approach, for building footprint extraction from remote sensing data, normalized DSM (nDSM), with a focus on dense residential areas

  • The results show that doing a binary classification of remote sensing data by using a deep convolutional network, in our case the Fully Convolution Network (FCN)-8, outperforms the binary mask generated by four-layer fully connected network

Read more

Summary

Introduction

Building detection and footprint extraction are important remote sensing tasks and used in the fields of urban planning and reconstruction, infrastructure development, three-dimensional (3D) building model generation, etc. Due to the sophisticated nature of urban environments the collection of building footprints from remotely sensed data is not yet productive and time consuming, if it is manually performed. Automatic methods are needed in order to complete the efficient collection of building footprints from large urban areas comprising of numerous constructions. Various approaches have been developed, which perform building extraction on the basis of high-resolution satellite imagery. Depending on the type of data employed for building extraction the existing methods can be divided into two main groups: using aerial or high-resolution satellite imagery and using three-dimensional (3D) information. Aerial photos and high-resolution satellite images are extensively used in urban studies. Due to the complexity of shapes and variety of materials of human-made constructions, the image classification in urban areas is still complicated

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.