A study of genomic data provenance in NoSQL document-oriented database systems

Valeria Guimaraes,Sergio Lifschitz,Maristela Holanda,Aleteia Araujo,Fernanda Hondo,Rodrigo Almeida,Harley Vera,Maria Emilia Walter

doi:10.1109/bibm.2015.7359902

A study of genomic data provenance in NoSQL document-oriented database systems

Valeria Guimaraes, Sergio Lifschitz + Show 6 more

https://doi.org/10.1109/bibm.2015.7359902

Copy DOI

Publication Date: Nov 1, 2015

Citations: 10

Affiliation: Universidade de Brasília

#Workflow Execution #Details Of Execution + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

This work considers a scientific experiment as a computational workflow. Provenance models store details of each workflow execution, including produced data, computational tools parameters and their versions, among others. This way, scientists can review details of a particular workflow execution, compare information generated among different executions and plan new ones efficiently. In the bioinformatics domain, particularly in the presence of large volumes of data, persistency of those data generated during the workflow execution is still a research challenge. In this article, we consider a study on provenance data storage for bioinformatics in a document-oriented NoSQL database system. We present data modeling issues and discuss an actual implementation into MongoDB.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.