Label-Free Distant Supervision for Relation Extraction via Knowledge Graph Embedding

Guanying Wang,Hai Zhu,Huajun Chen,Ruoxu Wang,Yalin Zhou,Xi Chen,Wei Zhang,Wen Zhang

doi:10.18653/v1/d18-1248

Guanying Wang, Hai Zhu + Show 6 more

Open Access

PDF Available

https://doi.org/10.18653/v1/d18-1248

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2018
Citations: 82	License type: cc-by

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Distant supervision is an effective method to generate large scale labeled data for relation extraction, which assumes that if a pair of entities appears in some relation of a Knowledge Graph (KG), all sentences containing those entities in a large unlabeled corpus are then labeled with that relation to train a relation classifier. However, when the pair of entities has multiple relationships in the KG, this assumption may produce noisy relation labels. This paper proposes a label-free distant supervision method, which makes no use of the relation labels under this inadequate assumption, but only uses the prior knowledge derived from the KG to supervise the learning of the classifier directly and softly. Specifically, we make use of the type information and the translation law derived from typical KG embedding model to learn embeddings for certain sentence patterns. As the supervision signal is only determined by the two aligned entities, neither hard relation labels nor extra noise-reduction model for the bag of sentences is needed in this way. The experiments show that the approach performs well in current distant supervision dataset.

Highlights

Distant Supervision was first proposed by Mintz (2009), which used seed triples in Freebase instead of manual annotation to supervise text
It indicates that our label-free supervision with prior knowledge introduced by the translation laws and entity types in Knowledge Graph (KG) is effective in avoiding noise, which can answer the second question we proposed at section 4 credibly
We argue that the noise label problem in distant supervision is mainly caused by the incomplete use of KG information

Summary

Introduction

Distant Supervision was first proposed by Mintz (2009), which used seed triples in Freebase instead of manual annotation to supervise text It marked text as relation r if (h, r, t) can be found in a known KG, where (h, t) is the pair of entities contained in the text. One way named Multi-Instance Learning(MIL) divided the sentences into different bags by (h, t), and tried to select well-labeled sentences from each bag (Zeng et al, 2015) or reduced the weight of mislabeled data (Lin et al, 2016). As both (T urkey Ankara) and (M exico Guadalajara) will be used to supervise the learning of the encoder for the pattern “in A, B”, it makes the embedding of the sentence pattern closer to the correct relation “/location/location/contains” instead of the wrong relation “/location/country/capital” In this way, we do not need to label the sentences with the hard relation labels anymore. In the experiments, we show that the labelfree approach performs well in current distant supervision dataset

Related works

Methodology

KG Embedding

Sentence Embedding

Margin loss

Datasets

Parameter Settings

Held-out Evaluation

Manual Evaluation

Case Study

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Label-Free Distant Supervision for Relation Extraction via Knowledge Graph Embedding

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fine-Grained Evaluation of Knowledge Graph Embedding Models in Downstream Tasks
Yuxin Zhang ... Ye Ji
-
Yuxin Zhang, et. al.Yuxin Zhang ... Ye Ji
01 Jan 2020
01 Jan 2020

Fine-Grained Evaluation of Knowledge Graph Embedding Model in Knowledge Enhancement Downstream Tasks
Yuxin Zhang ... Meng Wang
Big Data Research | VOL. 25
Yuxin Zhang, et. al.Yuxin Zhang ... Meng Wang
02 Mar 2021
Big Data Research | VOL. 25

DKGR: A Distributed Framework for Geometric Knowledge Graph Embedding with Ray
Mohammed Khatbane ... Malika Smaíl-Tabbone
Procedia Computer Science | VOL. 246
Mohammed Khatbane, et. al.Mohammed Khatbane ... Malika Smaíl-Tabbone
01 Jan 2024
Procedia Computer Science | VOL. 246

Rule-based data augmentation for knowledge graph embedding
Guangyao Li ... Wei Hu
AI Open | VOL. 2
Guangyao Li, et. al.Guangyao Li ... Wei Hu
01 Jan 2020
AI Open | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Label-Free Distant Supervision for Relation Extraction via Knowledge Graph Embedding

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers