Abstract
Abstract. Training Deep Learning (DL) algorithms for segmenting features require hundreds to thousands of input data and corresponding labels. Generating thousands of input images and labels requires considerable resources and time. Hence, it is common practice to use opensource imagery data and labels available online. Most of these open-source data have little or no metadata describing their quality or suitability making it problematic for training or evaluating DL models. This study evaluated the effect of data quality on training DeepLabV3+, using Sentinel 2 A/B RGB images and labels obtained from Kaggle. We generated subsets of 256 × 256 pixels, and 10% of these images (802) were set aside for testing. First, we trained and validated the DeepLabV3+ model with the remaining images. Second, we removed images with incorrect labels and trained another DeepLabV3+ network. Finally, we trained the third DeepLabV3+ network after removing images with turbid water or with floating vegetation. All three trained models were evaluated with test images and then we calculated accuracy metrics. As the quality of the input images improved, accuracy of the predicted masks generated from the first model increased from 92.8% to 94.3% in the second model. The third model’s accuracy was 96.4%, demonstrating the network’s ability to better learn and predict water bodies when the input data had fewer class variations. Based on the results we recommend assessing the quality of open-source data for incorrect labels and variations in the target class prior to training DeepLabV3+ or any other DL network.
Full Text
Topics from this Paper
Training Deep Learning
Incorrect Labels
Training Deep Learning Algorithms
Deep Learning Network
Deep Learning
+ Show 5 more
Create a personalized feed of these topics
Get StartedTalk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
iScience
Apr 1, 2022
Physics in Medicine & Biology
Feb 1, 2022
Journal of Petroleum Exploration and Production Technology
Jul 11, 2022
RadioGraphics
Apr 1, 2023
Frontiers in Psychiatry
Jul 9, 2021
Oct 8, 2021
Jun 6, 2021
Sensors (Basel, Switzerland)
Jul 2, 2022
Frontiers in plant science
May 31, 2023
Wireless Communications and Mobile Computing
Mar 7, 2022
Nov 1, 2020
Agriculture
Jul 6, 2022
Physica Medica
Mar 1, 2021
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Oct 19, 2023