RDF Triples Research Articles

The aim of Medical Knowledge Graph Completion is to automatically predict one of three parts (head entity, relationship, and tail entity) in RDF triples from medical data, mainly text data. Following their introduction, the use of pretrained language models, such as Word2vec, BERT, and XLNET, to complete Medical Knowledge Graphs has become a popular research topic. The existing work focuses mainly on relationship completion and has rarely solved entities and related triples. In this paper, a framework to predict RDF triples for Medical Knowledge Graphs based on word embeddings (named PTMKG-WE) is proposed, for the specific use for the completion of entities and triples. The framework first formalizes existing samples for a given relationship from the Medical Knowledge Graph as prior knowledge. Second, it trains word embeddings from big medical data according to prior knowledge through Word2vec. Third, it can acquire candidate triples from word embeddings based on analogies from existing samples. In this framework, the paper proposes two strategies to improve the relation features. One is used to refine the relational semantics by clustering existing triple samples. Another is used to accurately embed the expression of the relationship through means of existing samples. These two strategies can be used separately (called PTMKG-WE-C and PTMKG-WE-M, respectively), and can also be superimposed (called PTMKG-WE-C-M) in the framework. Finally, in the current study, PubMed data and the National Drug File-Reference Terminology (NDF-RT) were collected, and a series of experiments was conducted. The experimental results show that the framework proposed in this paper and the two improvement strategies can be used to predict new triples for Medical Knowledge Graphs, when medical data are sufficiently abundant and the Knowledge Graph has appropriate prior knowledge. The two strategies designed to improve the relation features have a significant effect on the lifting precision, and the superposition effect becomes more obvious. Another conclusion is that, under the same parameter setting, the semantic precision of word embedding can be improved by extending the breadth and depth of data, and the precision of the prediction framework in this paper can be further improved in most cases. Thus, collecting and training big medical data is a viable method to learn more useful knowledge.

Read full abstract

ObjectiveTo discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. MethodsWe propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. ResultsAccuracy classifier based on PubMedBERT achieved the best performance (F1 = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. ConclusionWe showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid.

Read full abstract

RDF Triples Research Articles

Related Topics

Articles published on RDF Triples

An Ontology-Based Automation System

Medical Knowledge Graph Completion Based on Word Embeddings

Knowledge Graph Construction for SOFL Formal Specifications

A fuzzy semantic representation and reasoning model for multiple associative predicates in knowledge graph

A formalization of one of the main claims of “OpenBiodiv: A knowledge graph for literature-extracted linked open data in biodiversity science” by Penev et al. 20191

An Ontology for Modeling Cultural Heritage Knowledge in Urban Tourism

Query Answer Reformulation over Knowledge Bases

RDFFrames: knowledge graph access for machine learning tools

Semantic 3D City Database — An enabler for a dynamic geospatial knowledge graph

Agent-Based Semantic Role Mining for Intelligent Access Control in Multi-Domain Collaborative Applications of Smart Cities

Improving text-to-image generation with object layout guidance

Relational schema optimization for RDF-based knowledge graphs

Drug repurposing for COVID-19 via knowledge graph completion

The Use of National Strategic Reference Framework Data in Knowledge Graphs and Data Mining to Identify Red Flags

Knowledge Graph Construction of High-Performance Computing Learning Platform

Mining information from sentences through Semantic Web data and Information Extraction tasks

MELODI Presto: a fast and agile tool to explore semantic triples derived from biomedical literature.

RDFFrames

Mapping Manuscript Migrations Knowledge Graph: Data for Tracing the History and Provenance of Medieval and Renaissance Manuscripts

Entity summarization with high readability and low redundancy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

RDF Triples Research Articles

Related Topics

Articles published on RDF Triples

An Ontology-Based Automation System

Medical Knowledge Graph Completion Based on Word Embeddings

Knowledge Graph Construction for SOFL Formal Specifications

A fuzzy semantic representation and reasoning model for multiple associative predicates in knowledge graph

A formalization of one of the main claims of “OpenBiodiv: A knowledge graph for literature-extracted linked open data in biodiversity science” by Penev et al. 20191

An Ontology for Modeling Cultural Heritage Knowledge in Urban Tourism

Query Answer Reformulation over Knowledge Bases

RDFFrames: knowledge graph access for machine learning tools

Semantic 3D City Database — An enabler for a dynamic geospatial knowledge graph

Agent-Based Semantic Role Mining for Intelligent Access Control in Multi-Domain Collaborative Applications of Smart Cities

Improving text-to-image generation with object layout guidance

Relational schema optimization for RDF-based knowledge graphs

Drug repurposing for COVID-19 via knowledge graph completion

The Use of National Strategic Reference Framework Data in Knowledge Graphs and Data Mining to Identify Red Flags

Knowledge Graph Construction of High-Performance Computing Learning Platform

Mining information from sentences through Semantic Web data and Information Extraction tasks

MELODI Presto: a fast and agile tool to explore semantic triples derived from biomedical literature.

RDFFrames

Mapping Manuscript Migrations Knowledge Graph: Data for Tracing the History and Provenance of Medieval and Renaissance Manuscripts

Entity summarization with high readability and low redundancy