Abstract

Abstract. Land use (LU) is an important information source commonly stored in geospatial databases. Most current work on automatic LU classification for updating topographic databases considers only one category level (e.g. residential or agricultural) consisting of a small number of classes. However, LU databases frequently contain very detailed information, using a hierarchical object catalogue where the number of categories differs depending on the hierarchy level. This paper presents a method for the classification of LU on the basis of aerial images that differentiates a fine-grained class structure, exploiting the hierarchical relationship between categories at different levels of the class catalogue. Starting from a convolutional neural network (CNN) for classifying the categories of all levels, we propose a strategy to simultaneously learn the semantic dependencies between different category levels explicitly. The input to the CNN consists of aerial images and derived data as well as land cover information derived from semantic segmentation. Its output is the class scores at three different semantic levels, based on which predictions that are consistent with the class hierarchy are made. We evaluate our method using two test sites and show how the classification accuracy depends on the semantic category level. While at the coarsest level, an overall accuracy in the order of 90% can be achieved, at the finest level, this accuracy is reduced to around 65%. Our experiments also show which classes are particularly hard to differentiate.

Highlights

  • Land use (LU) describes the socio-economic function of a piece of land

  • Digital orthophotos (DOP), a Digital Terrain Model (DTM), a Digital Surface Model (DSM) derived by image matching and land use objects from the German Authoritative Real Estate Cadastre Information System (ALKIS) are available

  • The evaluation is based on the number of correctly classified database objects and we report the average overall accuracy (OA) and F1 scores over both test runs of cross validation

Read more

Summary

Introduction

Land use (LU) describes the socio-economic function of a piece of land. This information is usually collected in geospatial databases, often acquired and maintained by national mapping agencies. The objects stored in these databases are typically represented by polygons with categories indicating the object’s LU To keep such databases up-to-date, the content can be compared with new remote sensing data. A class label related to its LU has to be determined from the remote sensing data for every object in the database This is achieved in a procedure consisting of two steps: first, the imagery is used to predict the land cover for each pixel; the land cover results and the images are combined in a second classification process to determine the LU for every database object (Gerke et al, 2008; Helmholtz et al, 2012). In this context, supervised classification methods are frequently applied, most recently based on Convolutional Neural Networks (CNN) (Zhang et al, 2018; Yang et al, 2019), which have been shown to outperform other classifiers such as Conditional Random Fields (CRF) (Albert et al, 2017)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call