Abstract

To address the need for published data, considerable effort has gone into formalizing the process of data publication. From funding agencies to publishers, data publication has rapidly become a requirement. Digital Object Identifiers (DOI) and data citations have enhanced the integration and availability of data. The challenge facing data publishers now is to deal with the increased number of publishable data products and most importantly the difficulties of publishing diverse data products into an online archive. The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC), a NASA-funded data center, faces these challenges as it deals with data products created by individual investigators. This paper summarizes the challenges of curating data and provides a summary of a workflow solution that ORNL DAAC researcher and technical staffs have created to deal with publication of the diverse data products. The workflow solution presented here is generic and can be applied to data from any scientific domain and data located at any data center.

Highlights

  • Up until the early 1990s, terrestrial ecology data publication comprised primarily of graphs, tables, and figures included in published manuscripts

  • A workflow for data ingest of the “soft” copy version of the data has been developed at the Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC) that is based on practices for data archival that formalizes interactions with users, compiles information, data files, and metadata, and releases the product to the public

  • This paper summarizes the challenges of curating data produced by the terrestrial ecology community and provides a workflow solution to efficiently archive diverse data products generated by researchers

Read more

Summary

A Semi-Automated Workflow Solution for Data Set Publication

Academic Editors: Constanze Curdt, Christian Willmes, Georg Bareth and Wolfgang Kainz Received: 20 December 2015; Accepted: 25 February 2016; Published: 8 March 2016

Introduction
ORNL DAAC
Data Ingest—Essential 5Ps
Data “Deluge” and Diversity—ORNL DAAC Case Study
Data Provider Interactions
ORNL DAAC Curation
Lessons Learned in using the SAuS Workflow
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call