Abstract

In numerous applications of land-use/land-cover (LULC) classification, the classification rules are determined using a set of training data; thus, the results are inherently affected by uncertainty in the selection of those data. Few studies have assessed the accuracy of LULC classification with this consideration. In this article, we provide a general expression of various measures of classification accuracy with regard to the sample data set for classifier training and the sample data set for the evaluation of the classification results. We conducted stochastic simulations for LULC classification of a two-feature two-class case and a three-feature four-class case to show the uncertainties in the training sample and reference sample confusion matrices. A bootstrap simulation approach for establishing the 95% confidence interval of the classifier global accuracy was proposed and validated through rigorous stochastic simulation. Moreover, theoretical relationships among the producer accuracy, user accuracy, and overall accuracy were derived. The results demonstrate that the sample size of class-specific training data and the a priori probabilities of individual LULC classes must be jointly considered to ensure the correct determination of LULC classification accuracy.

Highlights

  • Land-use/land-cover (LULC) classification using remote sensing images has been applied in numerous studies, including investigations involving environmental monitoring and change detection (Cheng et al, 2008; Chen et al, 2017), research on urbanization effects (Herold et al, 2002; Teng et al, 2008; Hung et al, 2010), and disaster mitigation (Zope et al, 2015; Yang et al, 2018)

  • We provide a general expression of various measures of classification accuracy with regard to the sample data set for classifier training and the sample data set for the evaluation of the classification results

  • The results demonstrate that the sample size of class-specific training data and the a priori probabilities of individual land use/land cover (LULC) classes must be jointly considered to ensure the correct determination of LULC classification accuracy

Read more

Summary

INTRODUCTION

Land-use/land-cover (LULC) classification using remote sensing images has been applied in numerous studies, including investigations involving environmental monitoring and change detection (Cheng et al, 2008; Chen et al, 2017), research on urbanization effects (Herold et al, 2002; Teng et al, 2008; Hung et al, 2010), and disaster mitigation (Zope et al, 2015; Yang et al, 2018). Class-specific PA and UA values (or, correspondingly, omission and commission errors) summarized in a confusion matrix can be regarded as sample accuracy, and these values are only estimates of the true and yet unknown global accuracy (or population accuracy) concerning the entire study area (Hay, 1988; Stehman and Czaplewski, 1998) These accuracies or errors are inherently associated with uncertainty because of the uncertainty in the selection of the training and reference samples (Weber and Langille, 2007). We performed a stochastic simulation of multi-class multivariate Gaussian distributions and conducted LULC classification using simulated samples to examine the proposed approach and identify essential concepts

METHODOLOGY
M pil j1
RESULTS AND DISCUSSION
Evaluation of Reference Sample Classification Accuracy
Evaluation of Training Sample Classification Accuracy
Evaluation of Bootstrap Sample Classification Accuracy
C2 C3 SUM
SUMMARY AND CONCLUSION
DATA AVAILABILITY STATEMENT
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call