Medical Knowledge Graph Completion Based on Word Embeddings

Mingxia Gao,Furong Chen,Jianguo Lu

doi:10.3390/info13040205

Mingxia Gao, Furong Chen + Show 1 more

Open Access

https://doi.org/10.3390/info13040205

Copy DOI

Abstract

The aim of Medical Knowledge Graph Completion is to automatically predict one of three parts (head entity, relationship, and tail entity) in RDF triples from medical data, mainly text data. Following their introduction, the use of pretrained language models, such as Word2vec, BERT, and XLNET, to complete Medical Knowledge Graphs has become a popular research topic. The existing work focuses mainly on relationship completion and has rarely solved entities and related triples. In this paper, a framework to predict RDF triples for Medical Knowledge Graphs based on word embeddings (named PTMKG-WE) is proposed, for the specific use for the completion of entities and triples. The framework first formalizes existing samples for a given relationship from the Medical Knowledge Graph as prior knowledge. Second, it trains word embeddings from big medical data according to prior knowledge through Word2vec. Third, it can acquire candidate triples from word embeddings based on analogies from existing samples. In this framework, the paper proposes two strategies to improve the relation features. One is used to refine the relational semantics by clustering existing triple samples. Another is used to accurately embed the expression of the relationship through means of existing samples. These two strategies can be used separately (called PTMKG-WE-C and PTMKG-WE-M, respectively), and can also be superimposed (called PTMKG-WE-C-M) in the framework. Finally, in the current study, PubMed data and the National Drug File-Reference Terminology (NDF-RT) were collected, and a series of experiments was conducted. The experimental results show that the framework proposed in this paper and the two improvement strategies can be used to predict new triples for Medical Knowledge Graphs, when medical data are sufficiently abundant and the Knowledge Graph has appropriate prior knowledge. The two strategies designed to improve the relation features have a significant effect on the lifting precision, and the superposition effect becomes more obvious. Another conclusion is that, under the same parameter setting, the semantic precision of word embedding can be improved by extending the breadth and depth of data, and the precision of the prediction framework in this paper can be further improved in most cases. Thus, collecting and training big medical data is a viable method to learn more useful knowledge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Apr 18, 2022
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Medical Knowledge Graph Completion Based on Word Embeddings

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development.
Linfeng Li ... Yuting Liu
JMIR Medical Informatics | VOL. 8
Linfeng Li, et. al.Linfeng Li ... Yuting Liu
21 May 2020
JMIR Medical Informatics | VOL. 8

Learning Better Word Embedding by Asymmetric Low-Rank Projection of Knowledge Graph
Fei Tian ... Bin Gao
Journal of Computer Science and Technology | VOL. 31
Fei Tian, et. al.Fei Tian ... Bin Gao
01 May 2016
Journal of Computer Science and Technology | VOL. 31

Knowledge and data-driven prediction of organ failure in critical care patients.
Xinyu Ma ... Wen Ouyang
Health information science and systems | VOL. 11
Xinyu Ma, et. al.Xinyu Ma ... Wen Ouyang
23 Jan 2023
Health information science and systems | VOL. 11

Knowledge graph embedding via multiplicative interaction
Zichao Huang ... Jian Yin
-
Zichao Huang, et. al.Zichao Huang ... Jian Yin
09 Mar 2018
09 Mar 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Medical Knowledge Graph Completion Based on Word Embeddings

Abstract

Talk to us

Similar Papers

More From: Information