Abstract

Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the individual record, making it hard for users to easily extract the data that are fit for their specific purposes. This article describes the different steps that were developed to analyse the quality and completeness of the distribution records within the European and international Ocean Biogeographic Information Systems (EurOBIS and OBIS). Records are checked on data format, completeness and validity of information, quality and detail of the used taxonomy and geographic indications and whether or not the record is a putative outlier. The corresponding quality control (QC) flags will not only help users with their data selection, they will also help the data management team and the data custodians to identify possible gaps and errors in the submitted data, providing scope to improve data quality. The results of these quality control procedures are as of now available on both the EurOBIS and OBIS databases. Through the Biology portal of the European Marine Observation and Data Network (EMODnet Biology), a subset of EurOBIS records—passing a specific combination of these QC steps—is offered to the users. In the future, EMODnet Biology will offer a wide range of filter options through its portal, allowing users to make specific selections themselves. Through LifeWatch, users can already upload their own data and check them against a selection of the here described quality control procedures.Database URL: www.eurobis.org (www.iobis.org; www.emodnet-biology.eu/)

Highlights

  • Progress in information technology has resulted in an increasing flood of data and information

  • The results show that—on average—85% of distribution records in Ocean Biogeographic Information System (OBIS) and its respective nodes can be used for species or genus specific analyses (Figure 1)

  • The development and implementation of the described quality control (QC) steps meets a need to be able to add quality flags to records and to filter out data based on user needs, taking into account the fitness for purpose of the available records

Read more

Summary

Introduction

Progress in information technology has resulted in an increasing flood of data and information. Evaluating and documenting the quality of data has already become a standard practice in several scientific disciplines over many years, e.g. in medicine [1,2,3,4], remote sensing [5,6,7] and gene sequencing [8,9,10] It is only in the last decade that its importance—in combination with the assessment of the fitness for use—has become evident for biological sciences, for biodiversity data and data related to species occurrences [11,12,13,14,15]. Both concepts are essential in research hypotheses, and in the field of conservation, management [16, 21, 22] and modelling [23,24,25]

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call