FAIR.ReD: Semantic knowledge graph infrastructure for the life sciences

Lars Vogt,Peter Grobe,Pier Luigi Buttigieg,Ricardo Usbeck,Sören Auer,Thomas Bartolomaeus,Markus Stocker,Peter Michalik

doi:10.3897/biss.3.37206

Lars Vogt, Peter Grobe + Show 6 more

Open Access

https://doi.org/10.3897/biss.3.37206

Copy DOI

Journal: Biodiversity Information Science and Standards	Publication Date: Jun 19, 2019
Citations: 1	License type: CC BY 4.0

Abstract

We would like to present FAIR Research Data: Semantic Knowledge Graph Infrastructure for the Life Sciences (in short, FAIR.ReD), a project initiative that is currently being evaluated for funding. FAIR.ReD is a software environment for developing data management solutions according to the FAIR (Findable, Accessible, Interoperable, Reusable; Wilkinson et al. 2016) data principles. It utilizes what we call a Data Sea Storage, which employs the idea of Data Lakes to decouple data storage from data access but modifies it by storing data in a semantically structured format as either semantic graphs or semantic tables, instead of storing them in their native form. Storage follows a top-down approach, resulting in a standardized storage model, which allows sharing data across all FAIR.ReD Knowledge Graph Applications (KGAs) connected to the same Sea, with newly developed KGAs having automatically access to all contents in the Sea. In contrast access and export of data follows a bottom-up approach that allows the specification of additional data models to meet the varying domain-specific and programmatic needs for accessing structured data. The FAIR.ReD engine enables bidirectional data conversion between the two storage models and any additional data model, which will substantially reduce conversion workload for data-rich institutes (Fig. 1). Moreover, with the possibility to store data in semantic tables, FAIR.ReD provides high performance storage for incoming data streams such as sensory data. FAIR.ReD KGAs are modularly organized. Modules can be edited using the FAIR.ReD editor and combined to form coherent KGAs. The editor allows domain experts to develop their own modules and KGAs without any programming experience required, thus also allowing smaller projects and individual researchers to build their own FAIR data management solution. Contents from FAIR.ReD KGAs can be published under a Creative Commons license as documents, micropublications, or nanopublications, each receiving their own DOI. A publication-life-cycle is implemented in FAIR.ReD and allows updating published contents for corrections or additions without overwriting the originally published version. Together with the fact that data and metadata are semantically structured and machine-readable, all contents from FAIR.ReD KGAs will comply with the FAIR Guiding Principles. Due to all FAIR.Red KGAs providing access to semantic knowledge graphs in both a human-readable and a machine-readable version, FAIR.ReD seamlessly integrates the complex RDF (Resource Description Framework) world with a more intuitively comprehensible presentation of data in form of data entry forms, charts, and tables. Guided by use cases, the FAIR.ReD environment will be developed using semantic programming where the source code of an application is stored in its own ontology. The set of source code ontologies of a KGA and its modules provides the steering logic for running the KGA. With this clear separation of steering logic from interpretation logic, semantic programming follows the idea of separating main layers of an application, analog to the separation of interpretation logic and presentation logic. Each KGA and module is specified exactly in this way and their source code ontologies stored in the Data Sea. Thus, all data and metadata are semantically transparent and so is the data management application itself, which substantially improves their sustainability on all levels of data processing and storing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FAIR.ReD: Semantic knowledge graph infrastructure for the life sciences

Abstract

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards

Lead the way for us

Similar Papers

From Data to Knowledge: A semantic knowledge graph application for curating specimen data
Peter Grobe ... Christian Köhler
Biodiversity Information Science and Standards | VOL. 3
Peter Grobe, et. al.Peter Grobe ... Christian Köhler
26 Jun 2019
Biodiversity Information Science and Standards | VOL. 3

Using Named Graphs and Knowledge Graph Template Patterns for Efficiently Organizing FAIR Anatomy Data and Metadata
Lars Vogt ... Roman Baum
Biodiversity Information Science and Standards | VOL. 3
Lars Vogt, et. al.Lars Vogt ... Roman Baum
19 Jun 2019
Biodiversity Information Science and Standards | VOL. 3

Technical Note: Ontology-guided radiomics analysis workflow (O-RAW).
Zhenwei Shi ... Johan Van Soest
Medical Physics | VOL. 46
Zhenwei Shi, et. al.Zhenwei Shi ... Johan Van Soest
25 Oct 2019
Medical Physics | VOL. 46

Research Data Management and Data Stewardship Competences in University Curriculum
Yuri Demchenko ... Lennart Stoy
-
Yuri Demchenko, et. al.Yuri Demchenko ... Lennart Stoy
21 Apr 2021
21 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FAIR.ReD: Semantic knowledge graph infrastructure for the life sciences

Abstract

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards