Abstract
We examine the intersection of the FAIR principles (Findable, Accessible, Interoperable and Reusable), the challenges and opportunities presented by the aggregation of widely distributed and heterogeneous data about biological and geological specimens, and the use of the Digital Object Architecture (DOA) data model and components as an approach to solving those challenges that offers adherence to the FAIR principles as an integral characteristic. This approach will be prototyped in the Distributed System of Scientific Collections (DiSSCo) project, the pan-European Research Infrastructure which aims to unify over 110 natural science collections across 21 countries. We take each of the FAIR principles, discuss them as requirements in the creation of a seamless virtual collection of bio/geo specimen data, and map those requirements to Digital Object components and facilities such as persistent identification, extended data typing, and the use of an additional level of abstraction to normalize existing heterogeneous data structures. The FAIR principles inform and motivate the work and the DO Architecture provides the technical vision to create the seamless virtual collection vitally needed to address scientific questions of societal importance.
Highlights
For hundreds of years, scientists have collected and studied plants, animals, rocks, minerals and fossils from our planet
CHALLENGES d A true virtual collection resource requires that data it holds must be findable, accessible, interoperable and reusable
Interoperable: The point of building a virtual collection of distributed data resources is to treat those as d a unified scientific asset – to be able to find and access data across the combined set, and to be able to re-combine and/or otherwise compute across the data to develop new knowledge and test the old
Summary
Scientists have collected and studied plants, animals, rocks, minerals and fossils from our planet. CHALLENGES d A true virtual collection resource requires that data it holds must be findable, accessible, interoperable and reusable. O Findable: The first requirement in building digital research infrastructures for bio- and geo-diversity is the C ability to find relevant resources, starting with the digital specimens and digital collections that anchor the information facet of infrastructure.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have