AbstractFor over 50 years, cores recovered from ocean basins have generated fossil, lithologic, and chemical archives that have revolutionized fields within the earth sciences. Although scientific ocean drilling (SOD) data are openly available following each expedition, the formats for these data are heterogeneous. Furthermore, lithological, chronological, and paleobiological data are typically separated into different repositories, limiting researchers' abilities to discover and analyze integrated SOD data sets. Emphasis within Earth Sciences on Findable, Accessible, Interoperable, and Reusable (FAIR) Data Principles and the establishment of community‐led databases provide a pathway to unite SOD data and further harness the scientific potential of the investments made in offshore drilling. Here, we describe a workflow for compiling, cleaning, and standardizing key SOD records, and importing them into the Paleobiology Database and Macrostrat, systems with versatile, open data distribution mechanisms. These efforts are being carried out by the extending Ocean Drilling Pursuits (eODP) project. eODP has processed all of the lithological, chronological, and paleobiological data from one SOD repository, along with numerous other data sets that were never deposited in a database; these were manually transcribed from original reports. This compiled data set contains over 79,899 lithological units from 1,125 drilling holes from 422 sites. Over 26,000 fossil‐bearing samples, with 5,378 taxonomic entries from 13 biological groups, are placed within this lithologic spatiotemporal framework. All information is available via GitHub and Macrostrat's application programming interface, which renders data retrievable by a variety of parameters, including age, site, and lithology.
Read full abstract