Abstract

Understanding and exploiting topographical data via standard machine learning techniques is challenging, mainly due to the large dynamic range of values present in elevation data and the lack of direct relationships between anthropogenic phenomena and topography, when considering topographic–geology couplings, for instance. Here we consider the first hurdle, dynamic range, in an effort to apply Convolutional Neural Network (CNN) approaches for prediction of human activity. CNN for learning 3-D elevation data relies on data normalization approaches, which only consider locally available points, thereby discarding contextual information and eliminating global contrast cues. We present a fully invertible and data-driven global partitioning elevation normalization (GPEN) preprocessing technique, which is intended to ameliorate the impact of limited data dynamic range. Global elevation populations are derived and used to formulate a distribution, which is used to adopt a partitioning scheme to remap all values according to global occurrence frequency, while preserving partition contrast. Using USGS 3-D Elevation Project and Microsoft building footprint data, we conduct a binary classification experiment predicting building footprint presence from elevation data, with and without a global remapping using the SegNet convolutional encoder-decoder model. The results of the experiment show more rapid model convergence, reduced regionalization errors, and enhanced classification metrics when compared to standard normalization preprocessing techniques. GPEN demonstrates performance over 10% higher than the next best conventional preprocessing method, with a mean overall accuracy of 94.76%. GPEN may show promise as an alternative normalization for deep learning with topological data.

Highlights

  • M ACHINE learning aims to learn and extract patterns from a dataset and its observations, and subsequently make decisions based on these data with minimal human oversight [1]

  • Based on the reserved testing class, the results for each processing method were evaluated on the basis of mean overall accuracy and mean intersection-over-union for the target building footprint class and background

  • The min–max approach exhibited the lowest performance with a mean overall accuracy of 61.73% and mean intersection-over-union (mIoU) of 0.234

Read more

Summary

Introduction

M ACHINE learning aims to learn and extract patterns from a dataset and its observations, and subsequently make decisions based on these data with minimal human oversight [1]. Manuscript received February 25, 2020; revised May 18, 2020; accepted June 10, 2020. Date of publication June 15, 2020; date of current version July 2, 2020. This is especially true within the fields of remote sensing and computer vision, where a wealth of imagery from airborne and spaceborne systems has fueled many campaigns intended to retrieve both scene physical parameters and semantic information. Tasks, which may have previously required a human analyst to exhaustively explore the data, have been automated. This capability to extract contextual information, in addition to direct measurement, has proven invaluable to the scientific community [2]

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.