Knowledge graph extension with a pre-trained language model via unified learning method

Bonggeun Choi,Youngjoong Ko

doi:10.1016/j.knosys.2022.110245

Abstract

Knowledge graphs (KGs) are collections of real-world knowledge that is represented by a structured form of triples. Since they are manually built in their nascent stage, there is a common problem that some links (triples) are missing. Knowledge graph completion (KGC) aims to find those missing links and thereby complete the KGs. However, as knowledge increases through diverse sources, new entities have explosively emerged and they are needed to be connected to existing KGs. Thus, open-world KGC is targeted on extending KGs to those new entities. Dealing with those new entities is challenging because they do not have any connection with entities in the existing KGs. One way to handle the new ones is to embed them with their textual descriptions with pre-trained word embeddings and score them in the graph-vector space with the existing typical KGC models. These models have resulted in meaningful results but there is still a lack of studies on utilizing the latest neural networks, such as pre-trained language models which are known to be better at capturing contexts than pre-trained word embeddings. This paper proposes a novel model that effectively connects new entities and existing KGs through a pre-trained language model. To effectively handle the problem, we utilize two learning methods; one is the classification method of the masked language model (MLM) that predicts a word among a huge vocabulary set with a given context, and the other is multi-task learning based on the Multi-Task for Deep Neural Networks (MT-DNN). Based on the methods, the model first generates an embedding of a new entity using its textual description and then uses the embedding to find one of the existing entities from a KG where the new entity can be connected. The experimental results on three benchmark datasets, DBPedia50k, FB15k-237-OWE, and FB20k, show that the proposed model improves performances by 9.2%p, 4.4%p, and 11.1%p, respectively, and achieves new state-of-the-art performance for all datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Knowledge graph extension with a pre-trained language model via unified learning method

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Jan 7, 2023
Citations: 4

Similar Papers

MEM-KGC: Masked Entity Model for Knowledge Graph Completion With Pre-Trained Language Model
Bonggeun Choi ... Youngjoong Ko
IEEE Access | VOL. 9
Bonggeun Choi, et. al.Bonggeun Choi ... Youngjoong Ko
01 Jan 2020
IEEE Access | VOL. 9

Simple Knowledge Graph Completion Model Based on Differential Negative Sampling and Prompt Learning
Li Duan ... Qiao Sun
Information | VOL. 14
Li Duan, et. al.Li Duan ... Qiao Sun
09 Aug 2023
Information | VOL. 14

Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models

-

25 Nov 2020
25 Nov 2020

Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models
Bosung Kim ... Jungyun Seo
-
Bosung Kim, et. al.Bosung Kim ... Jungyun Seo
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Knowledge graph extension with a pre-trained language model via unified learning method

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems