Abstract

High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.

Highlights

  • In order to sustain soil resources, land managers and researchers are in need of information on the continuous landscape-scale 15 distribution of soil properties

  • We studied the usefulness of clustering applications in addressing various aspects of the Digital Soil Mapping (DSM) procedure

  • Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure

Read more

Summary

Introduction

In order to sustain soil resources, land managers and researchers are in need of information on the continuous landscape-scale 15 distribution of soil properties. The categories of 20 these units do not necessarily represent soil systematic units and do not allow for the representation of small-scale, continuous variability. Overall, these soil maps were never meant to be used as input to landscape-scale process models that strive to simulate gas, matter and water flows. These soil maps were never meant to be used as input to landscape-scale process models that strive to simulate gas, matter and water flows From this demand and an advance in information technology, the domain of Digital Soil Mapping (DSM) has quickly advanced (Grunwald et al, 2011)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call