Building Trust in Earth Science Findings through Data Traceability and Results Explainability

Paula Olaya,Leobardo Valera,Rodrigo Vargas,Jay Lofstead,Ricardo Llamas,Michela Taufer,Dominic Kennedy

doi:10.1109/tpds.2022.3220539

Abstract

To trust findings in computational science, scientists need workflows that trace the data provenance and support results explainability. As workflows become more complex, tracing data provenance and explaining results become harder to achieve. In this paper, we propose a computational environment that automatically creates a workflow execution's record trail and invisibly attaches it to the workflow's output, enabling data traceability and results explainability. Our solution transforms existing container technology, includes tools for automatically annotating provenance metadata, and allows effective movement of data and metadata across the workflow execution. We demonstrate the capabilities of our environment with the study of SOMOSPIE, an earth science workflow. Through a suite of machine learning modeling techniques, this workflow predicts soil moisture values from the 27 km resolution satellite data down to higher resolutions necessary for policy making and precision agriculture. By running the workflow in our environment, we can identify the causes of different accuracy measurements for predicted soil moisture values in different resolutions of the input data and link different results to different machine learning methods used during the soil moisture downscaling, all without requiring scientists to know aspects of workflow design and implementation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Feb 1, 2023
Citations: 6	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Building Trust in Earth Science Findings through Data Traceability and Results Explainability

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Similar Papers

Evaluating Machine Learning and Geostatistical Methods for Spatial Gap-Filling of Monthly ESA CCI Soil Moisture in China
Hao Sun ... Qian Xu
Remote Sensing | VOL. 13
Hao Sun, et. al.Hao Sun ... Qian Xu
20 Jul 2021
Remote Sensing | VOL. 13

Spatial downscaling of SMAP soil moisture to high resolution using machine learning over China’s Loess Plateau
Ye Wang ... Li Li
Catena | VOL. 247
Ye Wang, et. al.Ye Wang ... Li Li
24 Oct 2024
Catena | VOL. 247

Downscaling and validating SMAP soil moisture using a machine learning algorithm over the Awash River basin, Ethiopia.
Shimelis Sishah ... Claudionor Ribeiro Da Silva
PloS one | VOL. 18
Shimelis Sishah, et. al.Shimelis Sishah ... Claudionor Ribeiro Da Silva
13 Jan 2023
PloS one | VOL. 18

Watershed scale soil moisture estimation model using machine learning and remote sensing in a data-scarce context
Marcelo Bueno Dueñas ... Hildo Loayza
Scientia Agropecuaria | VOL. 15
Marcelo Bueno Dueñas, et. al.Marcelo Bueno Dueñas ... Hildo Loayza
11 Mar 2024
Scientia Agropecuaria | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building Trust in Earth Science Findings through Data Traceability and Results Explainability

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems