Currently, morphological data and metadata are still mostly published as unstructured free texts, which lack semantic transparency, cannot be parsed by computers, and do not comply with the FAIR (Findable, Accessible, Interoperable, Reusable; Wilkinson et al. (2016) data principles, thus hampering their reuse by non-experts and their integration across many fields in the life sciences. With an ever-increasing amount of available ontologies and the development of adequate semantic technology, however, a solution to this problem becomes available. Instead of free text descriptions, morphological data and metadata can be recorded, stored, and communicated through the Web in the form of Resource Description Framework (RDF) triple statements that use the ‘Subject – Property – Object’ syntax of RDF and URIs of ontology classes and properties as well as URIs for individual entities as terminology. Since a given URI can take the ‘Subject’ position in one and the ‘Object’ position in another RDF statement, several triples can be linked to form a highly formalized and structured directed graph (semantic graph). After introducing an instance-based approach of recording morphological descriptions and their accompanying metadata as semantic knowledge graphs (i.e. Anatomy Knowledge Graphs), we propose a knowledge graph template pattern for each type of anatomical observation and a pattern for documenting metadata. The use of template patterns for knowledge graphs provides Interoperability and Reusability of comparable anatomical observations and of their accompanying metadata and a means to meaningfully visualize information contained in semantic graphs in a user-friendly HTML representation. Stored in a tuple store, Anatomy Knowledge Graphs become Findable and Accessible through the store’s SPARQL endpoint. As a consequence, anatomy data and metadata documented as Anatomy Knowledge Graphs in a tuple store are FAIR. Finally, we suggest a general scheme of how to efficiently organize Anatomy Knowledge Graphs in a tuple store framework based on instances of named graphs, with each individual named graph instantiating an ontology class that relates to a particular type of observation (e.g., weight measurement named graph class). A named graph is a fourth element in an RDF statement (‘Subject – Property – Object – Named Graph’), turning the triple into a quadruple. All RDF statements that share the same URI in the ‘Named Graph’ position belong to the same named graph. The use of named graph resources allows meaningful fragmentation of the contents of an Anatomy Knowledge Graph (Fig. 1), which in turn enables subsequent specification of all kinds of data views for managing and accessing morphological data and metadata. This scheme has been implemented in the description module of the prototype for semantic Morph∙D∙Base.
Read full abstract