Fully Convolutional Networks and Geographic Object-Based Image Analysis for the Classification of VHR Imagery

Nicholus Mboga,Moritz Lennert,Sabine Vanhuysse,Stefanos Georganos,Tais Grippa,Eléonore Wolff

doi:10.3390/rs11050597

Nicholus Mboga, Moritz Lennert + Show 4 more

Open Access

https://doi.org/10.3390/rs11050597

Copy DOI

Journal: Remote sensing	Publication Date: Mar 12, 2019
Citations: 48	License type: CC BY 4.0

Affiliation: Université Libre de Bruxelles

Abstract

Land cover Classified maps obtained from deep learning methods such as Convolutional neural networks (CNNs) and fully convolutional networks (FCNs) usually have high classification accuracy but with the detailed structures of objects lost or smoothed. In this work, we develop a methodology based on fully convolutional networks (FCN) that is trained in an end-to-end fashion using aerial RGB images only as input. Skip connections are introduced into the FCN architecture to recover high spatial details from the lower convolutional layers. The experiments are conducted on the city of Goma in the Democratic Republic of Congo. We compare the results to a state-of-the art approach based on a semi-automatic Geographic object image-based analysis (GEOBIA) processing chain. State-of-the art classification accuracies are obtained by both methods whereby FCN and the best baseline method have an overall accuracy of 91.3% and 89.5% respectively. The maps have good visual quality and the use of an FCN skip architecture minimizes the rounded edges that is characteristic of FCN maps. Additional experiments are done to refine FCN classified maps using segments obtained from GEOBIA generated at different scale and minimum segment size. High OA of up to 91.5% is achieved accompanied with an improved edge delineation in the FCN maps, and future work will involve explicitly incorporating boundary information from the GEOBIA segmentation into the FCN pipeline in an end-to-end fashion. Finally, we observe that FCN has a lower computational cost than the standard patch-based CNN approach especially at inference.

Highlights

Advances in remote sensing technology have increased the availability of a large volume of remote sensing data with a high spatial resolution
State-of-the-art deep learning baseline algorithms are used namely a fully convolutional network based on encoder-decoder network similar to SegNet [15,46] (FCN_dec) and a standard patch-based Convolutional neural networks (CNNs) (PB-CNN) that has a VGG-net type of architecture [47]
We have investigated the utility of deep fully convolutional networks for the classification of VHR aerial images of an urban environment in Goma, The Democratic Republic of Congo

Summary

Introduction

Advances in remote sensing technology have increased the availability of a large volume of remote sensing data with a high spatial resolution. One of the reasons for this is the use of downsampling layers which aim to increase the field of view of the CNN over the input data, but at the same time result in loss of high spatial details and localization of object boundaries. This can be quite limiting especially for some land cover classes such as buildings that mostly have sharply defined edges. Our FCN is implemented using the opensource Python libraries namely Keras and Theano [33,34]

GEOBIA Semi-Automatic Processing Chain

Overview of Abbreviations

Computation of Accuracy Metrics and Other Area Metrics

Findings

Discussion

Conclusions