Abstract

Understanding the limitations of soil data is essential for both managing environmental systems effectively and encouraging the responsible use of soil data. Explicit assessment of the uncertainties associated with soil data, and their storage in a soil database are therefore important. In practice, users will not want to separate ‘uncertain data’ from ‘certain data’ and will therefore require a single database that meets all the requirements of a conventional database, as well as the ability to handle uncertain data specifically. This chapter presents a framework that facilitates the storage of information about soil-data quality, including the uncertainties associated with soil data, in a conventional database design. It comprises a methodology for classifying data according to their attribute scale, which influences the structure of an uncertainty model, and their space–time variability, which determines the need for autocorrelation functions in describing uncertainty. In terms of the former, the key distinctions are among real numbers on a continuous domain, real or integer numbers on a discrete domain, categorical data and narrative data. In terms of the latter, the key distinctions are among data that are constant in space and time (e.g. universal constants); data that vary in time, but not in space; data that vary in space but not in time; and data that vary both in time and space. Thus, we distinguish 13 ‘data types’ to which individual datasets may be assigned. This simplifies the process of assessing uncertainties about soil data because characteristic uncertainty models can be defined for each data type. In general terms, an uncertain soil variable is completely specified by its probability distribution function (pdf). However, the complexity of a pdf varies with the 13 ‘data types’ identified. For example, the (cumulative) pdf of an uncertain numerical constant is simply a nondecreasing function on the real line. The database stores this function or some parameters of it, such as the mean and variance. Other data types are associated with more complex pdfs. For example, an uncertain categorical soil map requires the probability of each soil type occurring at any location to be defined (local uncertainty), as well as the spatial dependencies between these probabilities at multiple locations (spatial uncertainty). In practice, the complexity of the joint pdf will make it difficult or impossible to identify. Assumptions (such as a stationarity assumption) are therefore required to reduce the number of model parameters.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.