Big Data Provenance: Challenges, State of the Art and Opportunities.

Jianwu Wang,Daniel Crawl,Mai Nguyen,Shweta Purawat,Ilkay Altintas

doi:10.1109/bigdata.2015.7364047

Abstract

Ability to track provenance is a key feature of scientific workflows to support data lineage and reproducibility. The challenges that are introduced by the volume, variety and velocity of Big Data, also pose related challenges for provenance and quality of Big Data, defined as veracity. The increasing size and variety of distributed Big Data provenance information bring new technical challenges and opportunities throughout the provenance lifecycle including recording, querying, sharing and utilization. This paper discusses the challenges and opportunities of Big Data provenance related to the veracity of the datasets themselves and the provenance of the analytical processes that analyze these datasets. It also explains our current efforts towards tracking and utilizing Big Data provenance using workflows as a programming model to analyze Big Data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Big Data Provenance: Challenges, State of the Art and Opportunities.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data

Lead the way for us

Journal: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data	Publication Date: Oct 1, 2015
Citations: 92

Similar Papers

Big data provenance and analytics in telecom contact centers
S D Madhu Kumar ... T Radha Ramanan
-
S D Madhu Kumar, et. al.S D Madhu Kumar ... T Radha Ramanan
01 Nov 2017
01 Nov 2017

Big Data Quality for Data Mining in Business Intelligence Applications
Arun Thotapalli Sundararaman
-
Arun Thotapalli SundararamanArun Thotapalli Sundararaman
01 Jan 2020
01 Jan 2020

An Hybrid Approach to Quality Evaluation across Big Data Value Chain
Ikbal Taleb ... Hadeel T El Kassabi
-
Ikbal Taleb, et. al.Ikbal Taleb ... Hadeel T El Kassabi
01 Jun 2016
01 Jun 2016

Preface
-
Journal of Physics: Conference Series | VOL. 2179
--
01 Jan 2021
Journal of Physics: Conference Series | VOL. 2179

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data Provenance: Challenges, State of the Art and Opportunities.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data