A checklist recipe: making species data open and FAIR.

Lien Reyserhove,Damiano Oldoni,Quentin Groom,Tim Adriaens,Sonia Vanderhoeven,Peter Desmet,Diederik Strubbe,Filip Verloove,Amy J S Davis

doi:10.1093/database/baaa084

Abstract

Species checklists are a crucial source of information for research and policy. Unfortunately, many traditional species checklists vary wildly in their content, format, availability and maintenance. The fact that these are not open, findable, accessible, interoperable and reusable (FAIR) severely hampers fast and efficient information flow to policy and decision-making that are required to tackle the current biodiversity crisis. Here, we propose a reproducible, semi-automated workflow to transform traditional checklist data into a FAIR and open species registry. We showcase our workflow by applying it to the publication of the Manual of Alien Plants, a species checklist specifically developed for the Tracking Invasive Alien Species (TrIAS) project. Our approach combines source data management, reproducible data transformation to Darwin Core using R, version control, data documentation and publication to the Global Biodiversity Information Facility (GBIF). This checklist publication workflow is openly available for data holders and applicable to species registries varying in thematic, taxonomic or geographical scope and could serve as an important tool to open up research and strengthen environmental decision-making.

Highlights

Despite the numerous organizations investing in biodiversity data gathering, it is recognized that valuable data can often not be fully utilized or reused [1, 2]
The end product of the checklist publication workflow is a dataset that is openly available and complies with the FAIR principles. It is ‘Findable’ by its globally unique and persistent identifier (DOI, Figure 3F), described with rich metadata (Figure 3G) and registered in Global Biodiversity Information Facility (GBIF) (Figure 3A), ‘Accessible’ by clicking on the download link provided in GBIF (Figure 3B), ‘Interoperable’ as it uses a broadly applicable biodiversity standard and vocabularies provided by TDWG and GBIF (Figure 3D, H), ‘Reusable’ as it is associated with detailed provenance (Figure 3C) and released with a clear data usage license: the open Creative Commons license (Figure 3E)
The GBIF Integrated Publishing Toolkit (IPT) allows for version control of the published data and Google Docs allows for version control of metadata documents

Summary

Introduction

Despite the numerous organizations investing in biodiversity data gathering, it is recognized that valuable data can often not be fully utilized or reused [1, 2]. To publish the checklist on GBIF, metadata needs to conform to the GBIF Metadata Profile (GMP), an extension of Ecological Metadata Language (EML) [23]: a standard to record information about ecological datasets in XML This profile includes information related to the publisher, authors, keywords and geographic, taxonomic and temporal scope of the dataset, as well as project and sampling information, the latter of which can be used to document source data provenance and data transformation workflow. The checklist is ready for publication once the source data have been standardized to DwC, the dataset documented with metadata, and both sufficiently reviewed by the authors This can be done by creating a checklist resource on an IPT, ideally one hosted by a trusted data hosting center (https://www.gbif.org/data-hosting). For scientists unfamiliar to version control with Git and GitHub, see Blischak et al [25] for an introduction

Conclusion

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Database : the journal of biological databases and curation	Publication Date: Jan 1, 2020
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A checklist recipe: making species data open and FAIR.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database : the journal of biological databases and curation

Lead the way for us

Similar Papers

Assessing FAIRness of citizen science data in the context of the Green Deal Data Space
Victoria Lush ... J Masó
International Journal of Digital Earth | VOL. 17
Victoria Lush, et. al.Victoria Lush ... J Masó
09 May 2024
International Journal of Digital Earth | VOL. 17

Biodiversity Literature Repository: Building the customized FAIR repository by using custom metadata
Alexandros Ioannidis-Pantopikos ... Donat Agosti
Biodiversity Information Science and Standards | VOL. 5
Alexandros Ioannidis-Pantopikos, et. al.Alexandros Ioannidis-Pantopikos ... Donat Agosti
14 Sep 2021
Biodiversity Information Science and Standards | VOL. 5

A multi-omics data analysis workflow packaged as a FAIR Digital Object.
Alain J Van Gool ... Robert R J M Vermeiren
GigaScience | VOL. 13
Alain J Van Gool, et. al.Alain J Van Gool ... Robert R J M Vermeiren
02 Jan 2024
GigaScience | VOL. 13

From Raw Biodiversity Data to Indicators, Boosting Products Creation, Integration and Dissemination: French BON FAIR initiatives and related informatics solutions
Yvan Le Bras ... Jean-Baptiste Mihoub
Biodiversity Information Science and Standards | VOL. 3
Yvan Le Bras, et. al.Yvan Le Bras ... Jean-Baptiste Mihoub
20 Aug 2019
Biodiversity Information Science and Standards | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A checklist recipe: making species data open and FAIR.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database : the journal of biological databases and curation