RETA: A Schema-Aware, End-to-End Solution for Instance Completion in Knowledge Graphs

Paolo Rosso,Natalia Ostapuk,Dingqi Yang,Philippe Cudré-Mauroux

doi:10.1145/3442381.3449883

Abstract

Knowledge Graph (KG) completion has been widely studied to tackle the incompleteness issue (i.e., missing facts) in modern KGs. A fact in a KG is represented as a triplet (h, r, t) linking two entities h and t via a relation r. Existing work mostly consider link prediction to solve this problem, i.e., given two elements of a triplet predicting the missing one, such as (h, r, ?). This task has, however, a strong assumption on the two given elements in a triplet, which have to be correlated, resulting otherwise in meaningless predictions, such as (Marie Curie, headquarters location, ?). In addition, the KG completion problem has also been formulated as a relation prediction task, i.e., when predicting relations r for a given entity h. Without predicting t, this task is however a step away from the ultimate goal of KG completion. Against this background, this paper studies an instance completion task suggesting r-t pairs for a given h, i.e., (h, ?, ?). We propose an end-to-end solution called RETA (as it suggests the Relation and Tail for a given head entity) consisting of two components: a RETA-Filter and RETA-Grader. More precisely, our RETA-Filter first generates candidate r-t pairs for a given h by extracting and leveraging the schema of a KG; our RETA-Grader then evaluates and ranks the candidate r-t pairs considering the plausibility of both the candidate triplet and its corresponding schema using a newly-designed KG embedding model. We evaluate our methods against a sizable collection of state-of-the-art techniques on three real-world KG datasets. Results show that our RETA-Filter generates of high-quality candidate r-t pairs, outperforming the best baseline techniques while reducing by 10.61%-84.75% the candidate size under the same candidate quality guarantees. Moreover, our RETA-Grader also significantly outperforms state-of-the-art link prediction techniques on the instance completion task by 16.25%-65.92% across different datasets.

Highlights

Knowledge Graphs (KGs), such as Freebase [5], Wikidata1 or Google’s Knowledge Graph2, have become a key resource powering a broad spectrum of Web applications, such as semantic search [48], questionanswering [51], or recommender systems [54]
To implement our instance completion task, for a test h, we first generate a set of candidate r -t pairs, and score and rank them
We first take the top N relations generated by a relation prediction technique and use one tail candidate refinement technique to generate a set of candidate r -t pairs

Summary

INTRODUCTION

Knowledge Graphs (KGs), such as Freebase [5], Wikidata or Google’s Knowledge Graph, have become a key resource powering a broad spectrum of Web applications, such as semantic search [48], questionanswering [51], or recommender systems [54]. With a small set of predicted relations, the number of candidate r -t pairs fed to the link prediction technique can be significantly reduced Such an approach still shows subpar performance, as it fails to fully consider the triplewise correlation of the three elements in a triplet, in particular the schema information encoded in the entity-typed triplet (h_type, r, t_type). If we have the schema information represented as entity-typed triplets (h_type, r, t_type) —(enterprise, headquarters location, city) and (enterprise, industry, economic branch), we could filter out such noisy r -t pairs that do not match the schema of the KG (to the given h) Against this background and to effectively solve our instance completion problem over KGs (h, ?, ?), we propose an end-to-end solution fully leveraging schema information encoded in triplets. Our RETA-Grader significantly outperforms state-of-the-art link prediction techniques on the instance completion task by 16.25%-65.92% across different datasets

RELATED WORK

Link Prediction Task

Relation Prediction Task

Instance Completion Task

SCHEMA-AWARE INSTANCE COMPLETION

RETA-Filter

RETA-Grader

Experimental Setup

Performance on Filtering r -t Pairs

Performance on Ranking r -t Pairs

Method

Parameter Sensitivity Study

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RETA: A Schema-Aware, End-to-End Solution for Instance Completion in Knowledge Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Apr 19, 2021
Citations: 8	License type: cc-by

Similar Papers

Fine-Grained Evaluation of Knowledge Graph Embedding Models in Downstream Tasks
Yuxin Zhang ... Bohan Li
-
Yuxin Zhang, et. al.Yuxin Zhang ... Bohan Li
01 Jan 2020
01 Jan 2020

Fine-Grained Evaluation of Knowledge Graph Embedding Model in Knowledge Enhancement Downstream Tasks
Yuxin Zhang ... Han Yang
Big Data Research | VOL. 25
Yuxin Zhang, et. al.Yuxin Zhang ... Han Yang
02 Mar 2021
Big Data Research | VOL. 25

Rule-based data augmentation for knowledge graph embedding
Guangyao Li ... Wei Hu
AI Open | VOL. 2
Guangyao Li, et. al.Guangyao Li ... Wei Hu
01 Jan 2020
AI Open | VOL. 2

Sequence-to-Sequence Knowledge Graph Completion and Question Answering
Apoorv Saxena ... Adrian Kochsiek
-
Apoorv Saxena, et. al.Apoorv Saxena ... Adrian Kochsiek
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RETA: A Schema-Aware, End-to-End Solution for Instance Completion in Knowledge Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers