Impact of Training Set Size on the Ability of Deep Neural Networks to Deal with Omission Noise

Jonas Gütter,Julia Niebling,Anna Kruspe,Xiao Xiang Zhu

doi:10.3389/frsen.2022.932431

Jonas Gütter, Julia Niebling + Show 2 more

Open Access

https://doi.org/10.3389/frsen.2022.932431

Copy DOI

Abstract

Deep Learning usually requires large amounts of labeled training data. In remote sensing, deep learning is often applied for land cover and land use classification as well as street network and building segmentation. In case of the latter, a common way of obtaining training labels is to leverage crowdsourced datasets which can provide numerous types of spatial information on a global scale. However, labels from crowdsourced datasets are often limited in the sense that they potentially contain high levels of noise. Understanding how such noisy labels impede the predictive performance of Deep Neural Networks (DNNs) is crucial for evaluating if crowdsourced data can be an answer to the need for large training sets by DNNs. One way towards this understanding is to identify the factors which affect the relationship between label noise and predictive performance of a model. The size of the training set could be one of these factors since it is well known for being able to greatly influence a model’s predictive performance. In this work we pick the size of the training set and study its influence on the robustness of a model against a common type of label noise known as omission noise. To this end, we utilize a dataset of aerial images for building segmentation and create several versions of the training labels by introducing different amounts of omission noise. We then train a state-of-the-art model on subsets of varying size of those versions. Our results show that the training set size does play a role in affecting the robustness of our model against label noise: A large training set improves the robustness of our model against omission noise.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Remote Sensing	Publication Date: Jul 6, 2022
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Impact of Training Set Size on the Ability of Deep Neural Networks to Deal with Omission Noise

Abstract

Talk to us

Similar Papers

More From: Frontiers in Remote Sensing

Lead the way for us

Similar Papers

Choosing an appropriate training set size when using existing data to train neural networks for land cover segmentation
Huan Ning ... Lina Yang
Annals of GIS | VOL. 26
Huan Ning, et. al.Huan Ning ... Lina Yang
10 Aug 2020
Annals of GIS | VOL. 26

The use of on-line co-training to reduce the training set size in pattern recognition methods: Application to left ventricle segmentation in ultrasound
G Carneiro ... J C Nascimento
-
G Carneiro, et. al.G Carneiro ... J C Nascimento
01 Jun 2012
01 Jun 2012

Exploring the impact of size of training sets for the development of predictive QSAR models
Partha Pratim Roy ... Kunal Roy
Chemometrics and Intelligent Laboratory Systems | VOL. 90
Partha Pratim Roy, et. al.Partha Pratim Roy ... Kunal Roy
07 Aug 2007
Chemometrics and Intelligent Laboratory Systems | VOL. 90

Limited Generalization Capabilities of Autoencoders with Logistic Regression on Training Sets of Small Sizes
Alexey Potapov ... Maxim Peterson
-
Alexey Potapov, et. al.Alexey Potapov ... Maxim Peterson
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of Training Set Size on the Ability of Deep Neural Networks to Deal with Omission Noise

Abstract

Talk to us

Similar Papers

More From: Frontiers in Remote Sensing