Resource Description Framework Research Articles

The Norwegian Biodiversity Information Centre (NBIC) is currently building a traitbank for Norwegian species. The purpose of the NBIC TraitBank is to enhance sharing of traits and other information about species to support conservation actions and ecological research. The traitbank will cover a subset of traits for all multicellular species and taxa that are found in Norway. Observations of traits, collected through citizen science as well as by experts, are connected to the NBIC TraitBank ontology, forming a knowledge base. We have modeled TraitBank’s ontology in accordance with NBIC’s data management requirements, focusing on the domain knowledge necessary for ontology-based data integration of internal databases and queries specified by use cases. Our initial steps in ontology construction included outlining competency questions, which are natural language sentences expressing the questions system users expect an ontology to answer (Bezerra et al. 2013, Ren et al. 2014). The ontology for the TraitBank is populated using expert input (manual entry) and through harvesting of traits from existing internal databases at NBIC. We will present our experiences with implementing Reasonable Ontology Templates (OTTR) (Skjæveland et al. 2021) as the means for modeling the ontology and populating the TraitBank ontology. OTTR is a language for formally representing and instantiating ontology modeling patterns and is designed to support knowledge base construction and interaction at a higher level of abstraction. In the case of the TraitBank, ontology patterns are edited and published using a Semantic MediaWiki (SMW) extension for OTTR (FloSchroeder 2022), thereby providing a tool for the domain expert to work directly with templates. We build the TraitBank ontology by instantiating the templates directly in SMW as wiki pages. We argue that templates are an effective means to support the integration and use of digital biodiversity data in transparent ways, leading to successful collaboration and reuse of data. Following the "Don't repeat yourself" (DRY) principle of software development (Hunt and Thomas 1999), OTTR templates lend themselves well to easier ontology maintenance, allowing updates to occur through changes in individual template definitions rather than to repeated statements spread throughout the ontology. OTTR reshapes how domain experts work with ontologies and the data connected to the knowledge base, lifting the biodiversity expert away from dealing directly with logical axioms and Web Ontology Language (OWL). The template libraries have the power to improve international collaboration, making it easier to exchange and reuse specific templates and suggest improvements. Our templates include mappings to standards developed by Biodiversity Information Standards (TDWG) and biodiversity-related ontologies, linking to the international community. Use of OTTR supports the principles of Findable, Accessible, Interoperable, and Reusable (FAIR) data and demonstrates a new technology that can support the creation of an extensive online network of knowledge. Example: Scientific Name The NBIC uses Scientific Name as the main identifier and means to track a species. The OTTR template shown below captures the NBIC’s modelling pattern for Scientific Name. The signature of the template specifies the Internationalized Resource Identifier (IRI) of the template (adb-t:ScientificName), and six parameters (where ?iri is the 1st parameter). The parameters are used in the body of the template and define how instances of the template are expanded to Resource Description Framework (RDF) statements. Template instance expansion is done in a recursive manner, similar to many macro programming languages. With the OTTR template definition given in Fig. 1, a template instance can be expanded, as shown in the example for Metopa glacialis Fig. 2. The benefits of using the OTTR framework is that modeling patterns are explicitly represented as an OTTR template, allowing for instances of patterns to be compactly and consistently captured. The format of template instances lends itself to instantiation from tabular data sources like spreadsheets and databases.

Read full abstract

Nowadays, more and more biodiversity datasets containing observational and experimental data are collected and produced by different projects. In order to answer the fundamental questions of biodiversity research, these data need to be integrated for joint analyses. However, to date, too often, these data remain isolated in silos. Both in academia and industry, Knowledge Graphs (KGs) are widely regarded as a promising approach to overcome issues of data silos and lack of common understanding of data (Fensel and Şimşek 2020). KGs are graph-structured knowledge bases that store factual information in the form of structured relationships between entities, like “tree_species has_trait average_SLA” or “nutans is_observed_in SCH_Location" (Hogan et al. 2021). In our context, entities could be, e.g., abstract concepts like a kingdom, a species, or a trait, or a concrete specimen of a species. Example relationships could be "co-occurs" or, "possesses-trait". KGs for biodiversity have been proposed by Page 2019 and have also been the topic at prior TDWG conferences *1 (Page 2021). However, to date, uptake of this concept in the community has been rather slow (Sachs et al. 2019). We argue that this is at least partially due to the high effort and expertise required in developing and managing such KGs. Therefore, in our ongoing project, iKNOW (Babalou et al. 2021), we aim to provide a toolbox for reproducible KG creation. While iKNOW is still in an early stage, we aim to make this platform open-source and freely available to the biodiversity community. Thus, it can significantly contribute to making biodiversity data widely available, easily discoverable, and integratable. For now, we focus on tabular datasets resulting from biodiversity observation or sampling events or experiments. Given such a dataset, iKNOW will support its transformation into (subject, predicate, object) triples in the RDF standard (Resource Description Framework). Every uploaded dataset will be considered as a subgraph of the main KG in iKNOW. If required, data can be cleaned. After that, the entities and relationships among them should be extracted. For that, a user will be able select one of the existing semi-automatic tools available on our platform (e.g., JenTab (Abdelmageed and Schindler 2020)). The entities in this step can be linked to respective global identifiers in Wikidata, GBIF, the Global Biodiversity Information Facility, or any other user-selected knowledge resource. In the next step, (subject, predicate, object) triples based on the extracted information from the previous steps will be created. After these processes, the generated sub-KG can be used directly. However, one can take further steps such as: Triple Augmentation (generate new triples and extra relations to ease KG completion), Schema Refinement (refine the schema, e.g., via logical reasoning for the KG completion and correctness), Quality Checking (check the quality of the generated sub-KG), and Query Building (create customized SPARQL queries for the generated KG). iKNOW will include a wide range of functionalities for creating, accessing, querying, visualizing, updating, reproducing, and tracking the provenance of KGs. The reproducibility of such a creation is essential to strengthening the establishment of open science practices in the biodiversity domain. Thus, all information regarding the user-selected tools with parameters and settings, along with the initial dataset and intermediate results, will be saved in every step of our platform. With the help of this, users can redo the previous steps. Moreover, this enables us to track the provenance of the created KG. The iKNOW project is a joint effort by computer scientists and domain experts from the German Centre for Integrative Biodiversity Research (iDiv). As a showcase, we aim to create a KG of plant-related data sources at iDiv. These include, among others: TRY (the plant trait database) (Kattge and DÍaz 2011), sPlot (the database about global patterns of taxonomic, functional, and phylogenetic diversity) (Bruelheide and Dengler 2019), and PhenObs (the dataset of the global network of botanical gardens monitoring the impacts of climate change on the phenology of herbaceous plant species) (Nordt and Hensen 2021), LCVP, the Leipzig Catalogue of Vascular Plants, (Freiberg and Winter 2020), and many others. The resulting KG will serve as a discovery tool for biodiversity data and provide a robust infrastructure for managing biodiversity knowledge. From the biodiversity research perspective, iKNOW will contribute to creating a dataset following the Linked Open Data principles by interlinking to cross-domain and specific-domain KGs. From the computer science perspective, iKNOW will contribute to developing tools for dynamic, low-effort creation of reproducible knowledge graphs.

Read full abstract

Resource Description Framework Research Articles

Related Topics

Articles published on Resource Description Framework

Defeasible RDFS via rational closure

Bridging the gap between the semantic web and big data: answering SPARQL queries over NoSQL databases

Graph-based data management system for efficient information storage, retrieval and processing

The OneGraph vision: Challenges of breaking the graph model lock-in1

Optimisation Techniques for Flexible SPARQL Queries

Fixing the inconsistencies of continuous changing operations in fuzzy spatiotemporal RDF graph

Astoundingly Smart System Furnishing Ranking of Big Data in Search Engines

Space/time-efficient RDF stores based on circular suffix sorting

Fuzzy Spatiotemporal Data Modeling and Operations in RDF

Linked Metadata for FAIR Digital Objects Carrying Computable Knowledge

Updating Linked Data practices for FAIR Digital Object principles

A method for semantic-based image retrieval using hierarchical clustering tree and graph

Semantic sensor network ontology based decision support system for forest fire management

FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model

Semantic Protocol and Resource Description Framework Query Language: A Comprehensive Review

A Two-Phase Method for Optimization of the SPARQL Query

Exploiting lexical patterns for knowledge graph construction from unstructured text in Spanish

Reusable Ontology Modelling Patterns for Biodiversity Data with Reasonable Ontology Templates (OTTR)

IKNOW: A platform for knowledge graph construction for biodiversity

Modelling temporal data in knowledge graphs: a systematic review protocol.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Resource Description Framework Research Articles

Related Topics

Articles published on Resource Description Framework

Defeasible RDFS via rational closure

Bridging the gap between the semantic web and big data: answering SPARQL queries over NoSQL databases

Graph-based data management system for efficient information storage, retrieval and processing

The OneGraph vision: Challenges of breaking the graph model lock-in1

Optimisation Techniques for Flexible SPARQL Queries

Fixing the inconsistencies of continuous changing operations in fuzzy spatiotemporal RDF graph

Astoundingly Smart System Furnishing Ranking of Big Data in Search Engines

Space/time-efficient RDF stores based on circular suffix sorting

Fuzzy Spatiotemporal Data Modeling and Operations in RDF

Linked Metadata for FAIR Digital Objects Carrying Computable Knowledge

Updating Linked Data practices for FAIR Digital Object principles

A method for semantic-based image retrieval using hierarchical clustering tree and graph

Semantic sensor network ontology based decision support system for forest fire management

FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model

Semantic Protocol and Resource Description Framework Query Language: A Comprehensive Review

A Two-Phase Method for Optimization of the SPARQL Query

Exploiting lexical patterns for knowledge graph construction from unstructured text in Spanish

Reusable Ontology Modelling Patterns for Biodiversity Data with Reasonable Ontology Templates (OTTR)

IKNOW: A platform for knowledge graph construction for biodiversity

Modelling temporal data in knowledge graphs: a systematic review protocol.