Incorporating Training Data Uncertainty in Machine Learning Models for Satellite Imagery

Hamed Alemohammad

doi:10.5194/egusphere-egu23-10528

Abstract

Supervised machine learning (ML) models rely on labels in the training data to learn the patterns of interest. In Earth science applications, these labels are usually collected by humans either as labels annotated on imagery (such as land cover class) or as in situ measurements (such as soil moisture). Both annotations and in situ measurements contain uncertainties resulting from factors such as class misinterpretation and device error. These training data uncertainties propagate through the ML model training and result in uncertainties in the model outputs. Therefore, it is essential to quantify these uncertainties and incorporate them in the model [1].In this research, we will present results of inputting semantic segmentation label uncertainties into the model training and show that it improves model performance. The experiment is run using the LandCoverNet training dataset which contains global land cover labels based on time-series of Sentinel-2 multispectral imagery [2]. These labels are human annotations derived using a consensus algorithm based on the input labels from three independent annotators. The training dataset contains the consensus label and consensus score, and we treat the latter as a measure of uncertainty for each labeled pixel in the data. Our model architecture is a Convolutional Neural Network (CNN) trained on a subset of LandCoverNet with the rest of the dataset used for validation. We compare the results of this experiment with the same model trained on the dataset without the uncertainty information and show the improvement in the accuracy of the model.&#160;[1] Elmes, A., Alemohammad, H., Avery, R., Caylor, K., Eastman, J., Fishgold, L., Friedl, M., Jain, M., Kohli, D., Laso Bayas, J., Lunga, D., McCarty, J., Pontius, R., Reinmann, A., Rogan, J., Song, L., Stoynova, H., Ye, S., Yi, Z.-F., Estes, L. (2020). Accounting for Training Data Error in Machine Learning Applied to Earth Observations. Remote Sensing, 12(6), 1034. https://doi.org/10.3390/rs12061034[2] Alemohammad, H., Booth, K. (2020). LandCoverNet: A global benchmark land cover classification training dataset. NeurIPS 2020 Workshop on AI for Earth Sciences. http://arxiv.org/abs/2012.03

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Incorporating Training Data Uncertainty in Machine Learning Models for Satellite Imagery

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Global land cover classifications at 8 km spatial resolution: The use of training data derived from Landsat imagery in decision tree classifiers
R S De Fries ... R Sohlberg
International Journal of Remote Sensing | VOL. 19
R S De Fries, et. al.R S De Fries ... R Sohlberg
01 Jan 1998
International Journal of Remote Sensing | VOL. 19

Land cover mapping of North and Central America—Global Land Cover 2000
Rasim Latifovic ... Ian Olthof
Remote Sensing of Environment | VOL. 89
Rasim Latifovic, et. al.Rasim Latifovic ... Ian Olthof
13 Dec 2003
Land cover mapping of North and Central America—Global Land Cover 2000
Rasim Latifovic ... Ian Olthof

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Zhuoran Zhang ... Wenyuan Chen
-
Zhuoran Zhang, et. al.Zhuoran Zhang ... Wenyuan Chen
12 Jan 2023
12 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating Training Data Uncertainty in Machine Learning Models for Satellite Imagery

Abstract

Talk to us

Similar Papers