Enriched biodiversity data as a resource and service.

Jordan Biserkov,Rutger A Vos ,David King,Thomas Hamann,David Shorthouse,Alan Williams,Robert Hoehndorf,Tom Van Dooren,Patricia Kelbert,Youri Lammers,Daniel Mietchen,Bachir Balech,Aleksandra Pawlik,Ayco Holleman,Marko Tähtinen,Hannes Hettling,Serrano Pereira,Quentin Groom,Claus Weiland,Don Kirkup,Matthew Blissett,Thibaut Demeulemeester,Christian Brenninkmeijer,Ross Mounce,Niall Beard,Lyubomir Penev,Soraya Sierra,George Gosline,Guido Sautter,Nicky Nicolson ,Jeremy A Miller ,P Hovenkamp ,David C Eades ,Roderic Page ,K Andrew R Richards

doi:10.3897/bdj.2.e1125

Abstract

Background: Recent years have seen a surge in projects that produce large volumes of structured, machine-readable biodiversity data. To make these data amenable to processing by generic, open source “data enrichment” workflows, they are increasingly being represented in a variety of standards-compliant interchange formats. Here, we report on an initiative in which software developers and taxonomists came together to address the challenges and highlight the opportunities in the enrichment of such biodiversity data by engaging in intensive, collaborative software development: The Biodiversity Data Enrichment Hackathon.Results: The hackathon brought together 37 participants (including developers and taxonomists, i.e. scientific professionals that gather, identify, name and classify species) from 10 countries: Belgium, Bulgaria, Canada, Finland, Germany, Italy, the Netherlands, New Zealand, the UK, and the US. The participants brought expertise in processing structured data, text mining, development of ontologies, digital identification keys, geographic information systems, niche modeling, natural language processing, provenance annotation, semantic integration, taxonomic name resolution, web service interfaces, workflow tools and visualisation. Most use cases and exemplar data were provided by taxonomists.One goal of the meeting was to facilitate re-use and enhancement of biodiversity knowledge by a broad range of stakeholders, such as taxonomists, systematists, ecologists, niche modelers, informaticians and ontologists. The suggested use cases resulted in nine breakout groups addressing three main themes: i) mobilising heritage biodiversity knowledge; ii) formalising and linking concepts; and iii) addressing interoperability between service platforms. Another goal was to further foster a community of experts in biodiversity informatics and to build human links between research projects and institutions, in response to recent calls to further such integration in this research domain.Conclusions: Beyond deriving prototype solutions for each use case, areas of inadequacy were discussed and are being pursued further. It was striking how many possible applications for biodiversity data there were and how quickly solutions could be put together when the normal constraints to collaboration were broken down for a week. Conversely, mobilising biodiversity knowledge from their silos in heritage literature and natural history collections will continue to require formalisation of the concepts (and the links between them) that define the research domain, as well as increased interoperability between the software platforms that operate on these concepts.

Highlights

The Royal Society report on “Science as an Open Enterprise” recommends that “scientists should communicate the data they collect and the models they create, to allow free and open access, and in ways that are intelligible, accessible and usable for other specialists in the same or linked fields wherever they are in the world” (Boulton et al 2012)
Mobilising biodiversity knowledge from their silos in heritage literature and natural history collections will continue to require formalisation of the concepts that define the research domain, as well as increased interoperability between the software platforms that operate on these concepts
All hackathon participants had a priori indicated their agreement with open source licensing of all code and documentation produced at the hackathon, and so we present the tangible outcomes of these breakout groups for the benefit of the larger community

Summary

Results

Accomplishments - The advent of biodiversity data aggregators such as Plazi (Agosti and Egloff 2009) have made it possible to perform analytics on biodiversity data use and reuse. The adoption of the design pattern has considerably lowered the barrier for developers to implement and deploy simple services, it poses challenges to clients and end users, as the way in which they are expected to interact with these heterogeneous service interfaces is not always documented clearly Examples of this include the various “RESTful” as well as SOAP/WSDL-based web services delivered to the BioVeL project (Vicario et al 2012). One of the deliverables of the BioVeL project (Vicario et al 2011) is a workflow “player” that can be embedded in HTML, exposing the functionality of a given Taverna workflow within a web page This breakout group sought to combine these two tools, such that iPython notebook users are able to submit data assembled during their analysis to a Taverna workflow embedded in a player and retrieving the results.

Conclusions

Introduction

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biodiversity Data Journal	Publication Date: Jun 16, 2014
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Enriched biodiversity data as a resource and service.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biodiversity Data Journal

Lead the way for us

Similar Papers

BioGeomancer: Automated Georeferencing to Map the World's Biodiversity Data
Robert P Guralnick ... Reed Beaman
PLoS Biology | VOL. 4
Robert P Guralnick, et. al.Robert P Guralnick ... Reed Beaman
01 Nov 2006
PLoS Biology | VOL. 4

Bringing Together Species Observations: A Case Story of Sweden’s Biodiversity Informatics Infrastructures
Jesse D Peterson ... René Van Der Wal
Minerva | VOL. 61
Jesse D Peterson, et. al.Jesse D Peterson ... René Van Der Wal
03 Apr 2023
Minerva | VOL. 61

From Raw Biodiversity Data to Indicators, Boosting Products Creation, Integration and Dissemination: French BON FAIR initiatives and related informatics solutions
Yvan Le Bras ... Aurélie Delavaud
Biodiversity Information Science and Standards | VOL. 3
Yvan Le Bras, et. al.Yvan Le Bras ... Aurélie Delavaud
20 Aug 2019
Biodiversity Information Science and Standards | VOL. 3

Biodiversity Informatics: the emergence of a field
Indra Neil Sarkar
BMC Bioinformatics | VOL. 10
Indra Neil SarkarIndra Neil Sarkar
01 Nov 2009
BMC Bioinformatics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enriched biodiversity data as a resource and service.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biodiversity Data Journal