Abstract

prov-template is a declarative approach that enables designers and programmers to design and generate provenance compatible with the prov standard of the World Wide Web Consortium. Designers specify the topology of the provenance to be generated by composing templates, which are provenance graphs containing variables, acting as placeholders for values. Programmers write programs that log values and package them up in sets of bindings, a data structure associating variables and values. An expansion algorithm generates instantiated provenance from templates and sets of bindings in any of the serialisation formats supported by prov . A quantitative evaluation shows that sets of bindings have a size that is typically 40 percent of that of expanded provenance templates and that the expansion algorithm is suitably tractable, operating in fractions of milliseconds for the type of templates surveyed in the article. Furthermore, the approach shows four significant software engineering benefits: separation of responsibilities, provenance maintenance, potential runtime checks and static analysis, and provenance consumption. The article gathers quantitative data and qualitative benefits descriptions from four different applications making use of prov-template . The system is implemented and released in the open-source library ProvToolbox for provenance processing.

Highlights

  • PROVENANCE has gained a lot of traction lately in various areas including the Web, legal notices,1 climate science,2 scientific workflows [1], [2], [3], computational reproducibility [4], emergency response [5], medical applications,3 geospatial domain,4 art and food

  • A potential challenge with the above techniques is that, while the programmer’s task is facilitated because there is no requirement to program the topology of provenance graphs, a potential new source of error comes with variables names, and the burden of ensuring that they correspond to the variables occurring in the templates

  • We have presented PROV-TEMPLATE, a practical approach that facilitates the generation of provenance

Read more

Summary

A Templating System to Generate Provenance

Luc Moreau , Belfrit Victor Batlajery, Trung Dong Huynh, Danius Michaelides, and Heather Packer. Abstract—PROV-TEMPLATEis a declarative approach that enables designers and programmers to design and generate provenance compatible with the PROV standard of the World Wide Web Consortium. Programmers write programs that log values and package them up in sets of bindings, a data structure associating variables and values. An expansion algorithm generates instantiated provenance from templates and sets of bindings in any of the serialisation formats supported by PROV. A quantitative evaluation shows that sets of bindings have a size that is typically 40 percent of that of expanded provenance templates and that the expansion algorithm is suitably tractable, operating in fractions of milliseconds for the type of templates surveyed in the article. The article gathers quantitative data and qualitative benefits descriptions from four different applications making use of PROV-TEMPLATE. The system is implemented and released in the open-source library ProvToolbox for provenance processing

INTRODUCTION
PROVENANCE APPLICATIONS AND EXAMPLE
An Example of Template
Bindings and Template Expansion
The Difficulty of Generating Provenance without Template
ARCHITECTURAL OVERVIEW
Template Definition
Simple Set of Bindings
Simple Name Replacing in Templates
Linked Names
Complex Sets of Bindings and Template Expansion
QUANTITATIVE EVALUATION
BINDINGS GENERATION
Abstract Bindings Creation
Bindings Fragments
Support for Checks
Concrete Bindings Creation
PRACTICAL EXPERIENCE
Benefit 1
Benefit 2
Benefit 3
Benefit 4
RELATED WORK
Coarse-Grained and Fine-Grained Provenance
Provenance
Provenance Templates and Views
Findings
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call