A comprehensive record of research data provenance is essential for the successful curation, management, and reuse of data over time. However, creating such detailed metadata can be onerous, and there are few structured methods for doing so. In this case study of data curation in support of geobiology research conducted at Yellowstone National Park, we describe a method of “Research Process Modeling” for documenting noncomputational data provenance in a structured yet flexible way. The method combines systems analysis techniques to model research activities, the World Wide Web Consortium Provenance (PROV) ontology to illustrate relationships between data products, and simple inventory methods to account for research processes and data products. It also supports collaborative data curation between information professionals and researchers, and is therefore a significant step toward producing more useable and interpretable research data. We demonstrate how this method describes data provenance more robustly than “flat” metadata alone and fills a critical gap in the documentation of provenance for field‐based and noncomputational workflows. We discuss potential applications of this approach to other research domains.
Read full abstract