SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS

D Marmanis,K Schindler,U Stilla,J D Wegner,M Datcu,S Galliani

doi:10.5194/isprsannals-iii-3-473-2016

Abstract

This paper describes a deep learning approach to semantic segmentation of very high resolution (aerial) images. Deep neural architectures hold the promise of end-to-end learning from raw images, making heuristic feature design obsolete. Over the last decade this idea has seen a revival, and in recent years deep convolutional neural networks (CNNs) have emerged as the method of choice for a range of image interpretation tasks like visual recognition and object detection. Still, standard CNNs do not lend themselves to per-pixel semantic segmentation, mainly because one of their fundamental principles is to gradually aggregate information over larger and larger image regions, making it hard to disentangle contributions from different pixels. Very recently two extensions of the CNN framework have made it possible to trace the semantic information back to a precise pixel position: deconvolutional network layers undo the spatial downsampling, and Fully Convolution Networks (FCNs) modify the fully connected classification layers of the network in such a way that the location of individual activations remains explicit. We design a FCN which takes as input intensity and range data and, with the help of aggressive deconvolution and recycling of early network layers, converts them into a pixelwise classification at full resolution. We discuss design choices and intricacies of such a network, and demonstrate that an ensemble of several networks achieves excellent results on challenging data such as the &lt;i&gt;ISPRS semantic labeling benchmark&lt;/i&gt;, using only the raw data as input.

Highlights

Large amounts of very high resolution (VHR) remote sensing images are acquired daily with either airborne or spaceborne platforms, mainly as base data for mapping and earth observation
This paper describes a deep learning approach to semantic segmentation of very high resolution images
Standard convolutional neural networks (CNNs) do not lend themselves to per-pixel semantic segmentation, mainly because one of their fundamental principles is to gradually aggregate information over larger and larger image regions, making it hard to disentangle contributions from different pixels

Summary

Introduction

Large amounts of very high resolution (VHR) remote sensing images are acquired daily with either airborne or spaceborne platforms, mainly as base data for mapping and earth observation. What makes automation challenging for VHR images is that on the one hand their spectral resolution is inherently lower, on the other hand small objects and small-scale surface texture become visible. Together, this leads to high within-class variability of the image intensities, and at the same time low inter-class differences. Semantic segmentation in urban areas poses the additional challenge that many man-made object categories are composed of a large number of different materials, and that objects in cities (such as buildings or trees) are small and interact with each other through occlusions, cast shadows, inter-reflections, etc

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences	Publication Date: Jun 6, 2016
Citations: 136	License type: cc-by

R Discovery Prime

R Discovery Prime

SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences

Lead the way for us

Similar Papers

SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS
D Marmanis ... J D Wegner
ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. III-3
D Marmanis, et. al.D Marmanis ... J D Wegner
06 Jun 2016
ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. III-3

Self-supervised multi-task learning for semantic segmentation of urban scenes
Jonathan Gonzalez-Santiago ... Jon Atli Benediktsson
-
Jonathan Gonzalez-Santiago, et. al.Jonathan Gonzalez-Santiago ... Jon Atli Benediktsson
12 Sep 2021
12 Sep 2021

Large-Scale Image Segmentation with Convolutional Networks
...
-
, et. al. ...
01 Jan 2017
01 Jan 2017

Semantic Segmentation using Modified U-Net for Autonomous Driving
T Sugirtha ... M Sridevi
-
T Sugirtha, et. al.T Sugirtha ... M Sridevi
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences