Abstract

Building extraction from remotely sensed imagery plays an important role in urban planning, disaster management, navigation, updating geographic databases, and several other geospatial applications. Several published contributions dedicated to the applications of deep convolutional neural networks (DCNN) for building extraction using aerial/satellite imagery exists. However, in all these contributions, high accuracy is always obtained at the price of extremely complex and large network architectures. In this paper, we present an enhanced fully convolutional network (FCN) framework that is designed for building extraction of remotely sensed images by applying conditional random fields (CRFs). The main objective is to propose a methodology selecting a framework that balances high accuracy with low network complexity. A modern activation function, namely, the exponential linear unit (ELU), is applied to improve the performance of the fully convolutional network (FCN), thereby resulting in more accurate building prediction. To further reduce the noise (falsely classified buildings) and to sharpen the boundaries of the buildings, a post-processing conditional random fields (CRFs) is added at the end of the adopted convolutional neural network (CNN) framework. The experiments were conducted on Massachusetts building aerial imagery. The results show that our proposed framework outperformed the fully convolutional network (FCN), which is the existing baseline framework for semantic segmentation, in terms of performance measures such as the F1-score and IoU measure. Additionally, the proposed method outperformed a pre-existing classifier for building extraction using the same dataset in terms of the performance measures and network complexity.

Highlights

  • Recent years have witnessed technological advancement [1,2] in the remote-sensing community along with great administrative [3] and jurisdictional changes [4] that encourage the production and use of satellite imagery

  • Our first strategy aims at increasing the accuracy of the network by using exponential linear unit (ELU) as an activation function rather than the traditional rectified linear unit (ReLU)

  • In terms of Intersection over Union (IoU) value, the combined method (89.08%) outperforms fully convolutional network (FCN) (86.96%), with an increase of 2.1%. These increases in F1-measure and IoU value are attributed to increases in both precision and recall values compared to the original FCN network

Read more

Summary

Introduction

Recent years have witnessed technological advancement [1,2] in the remote-sensing community along with great administrative [3] and jurisdictional changes [4] that encourage the production and use of satellite imagery This collective development suggests that affordable access to massive amounts of high-resolution aerial/satellite imagery with long revisit times will become plausible over the coming decades. Extracting building images from satellite imagery will certainly benefit urban planning, disaster management, navigation, updating the geographic database, and several other geospatial applications [5,6] To enable such quantification and analysis using geographic information systems, a raw image should be transformed into tangible information [7]. This limits the availability of up-to-date and reliable building maps and the information that is contained in new image data to those who need it most

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call