The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data

Giles Foody,Duccio Rocchini,Mahesh Pal,Carol Garzon-Lopez,Lucy Bastin

doi:10.3390/ijgi5110199

Abstract

The accuracy of a map is dependent on the reference dataset used in its construction. Classification analyses used in thematic mapping can, for example, be sensitive to a range of sampling and data quality concerns. With particular focus on the latter, the effects of reference data quality on land cover classifications from airborne thematic mapper data are explored. Variations in sampling intensity and effort are highlighted in a dataset that is widely used in mapping and modelling studies; these may need accounting for in analyses. The quality of the labelling in the reference dataset was also a key variable influencing mapping accuracy. Accuracy varied with the amount and nature of mislabelled training cases with the nature of the effects varying between classifiers. The largest impacts on accuracy occurred when mislabelling involved confusion between similar classes. Accuracy was also typically negatively related to the magnitude of mislabelled cases and the support vector machine (SVM), which has been claimed to be relatively insensitive to training data error, was the most sensitive of the set of classifiers investigated, with overall classification accuracy declining by 8% (significant at 95% level of confidence) with the use of a training set containing 20% mislabelled cases.

Highlights

Maps are widely used in scientific research
This paper aims to explore the sensitivity of mapping methods to error and uncertainty in the reference datasets used in map derivation
The classifications based upon the original, assumed to be error-free, training set showed that the classification from support vector machine (SVM) (89.11%) was slightly more accurate than all of the other classifications; the accuracy of the classification from the discriminant analysis, relevance vector machine (RVM), and sparse multinomial logistic regression (SMLR) were 86.88%, 88.0%, and 88.67%, respectively

Summary

Introduction

Maps are widely used in scientific research. Their accuracy can, be critical, with the effect of map error being dramatic in a range of applications (e.g., [1]). The estimated value of ecosystem services for the conterminous USA determined using the National Land Cover Database (2006) changes from $1118 billion/y to $600 billion/y after adjustment for the known error in the maps used [2]. The databases may contain errors of varying nature and magnitude such as mislabelling arising from confusion between classes [3], which may vary regionally if, for example, the skills and expertise of data collectors vary These various sources of error (e.g., mislabelled cases) and uncertainty (e.g., ambiguous class membership) may degrade mapping and the effect may vary between mapping methods. It focuses on thematic mapping such as species distribution maps and land cover

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS International Journal of Geo-Information	Publication Date: Nov 1, 2016
Citations: 110	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information

Lead the way for us

Similar Papers

Impacts of imperfect reference data on the apparent accuracy of species presence-absence models and their predictions
Giles M Foody
Global Ecology and Biogeography | VOL. 20
Giles M FoodyGiles M Foody
29 Oct 2010
Global Ecology and Biogeography | VOL. 20

A Cluster Graph Approach to Land Cover Classification Boosting
Lloyd Haydn Hughes ... Johan Du Preez
Data | VOL. 4
Lloyd Haydn Hughes, et. al.Lloyd Haydn Hughes ... Johan Du Preez
10 Jan 2019
Data | VOL. 4

Evaluating capabilities of machine learning algorithms for aquatic vegetation classification in temperate wetlands using multi-temporal Sentinel-2 data
Erika Piaser ... Paolo Villa
International Journal of Applied Earth Observation and Geoinformation | VOL. 117
Erika Piaser, et. al.Erika Piaser ... Paolo Villa
24 Jan 2023
International Journal of Applied Earth Observation and Geoinformation | VOL. 117

Discrepancies in pediatric bone mineral density reference data: Potential for misdiagnosis of osteopenia
Mary B Leonard ... Harold I Feldman
The Journal of Pediatrics | VOL. 135
Mary B Leonard, et. al.Mary B Leonard ... Harold I Feldman
01 Aug 1999
The Journal of Pediatrics | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information