Abstract
Provenance in the context of workflows, both for their specification and for the data they derive, is essential for result reproducibility, sharing, and knowledge reuse in the scientific community. The information models to capture the provenance of scientific workflows, known as provenance models, have been of wide interest and studied in many fields, including the semantic Web. However, most existing provenance models rely on commonly perceived data-driven execution paradigm of scientific workflows and overlook control constructs that may appear in scientific workflows. The provenance models proposed by the semantic Web community for data-driven scientific workflows underspecify the structure of the workflows (i.e., workflow provenance). Such an underspecified workflow structure can result in the misinterpretation of a scientific experiment and precludes its conformance checking, thus restricting provenance gains.This paper shows that the design of a provenance model for control-flow constructs and the means to integrate it with the existing provenance models can maximise the provenance gains for the users of scientific workflows. In this regard, first, we specify the need for a control-flow driven scientific workflow provenance model and detail the minimal characteristics which such a model should address. Secondly, we present a formal model to specify the control-flows that may appear in scientific workflows and describe how the existing provenance models can be complemented by using the proposed model. Finally, we show that the proposed model can capture accurate provenance information for the scientific workflows, which makes it possible to understand, reproduce and validate workflows and their output.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.