Abstract

Relation extraction (RE) aims to mine semantic relations between entity pairs from plain texts, which plays an important role in various natural language processing (NLP) tasks. However, the existing methods in distant supervision (DS) are sensitive to bags and fail to handle sentence-level relation prediction. In particular, few methods focus on the sentence-level label denoising. In this paper, the sentence-level label denoising model based on reinforcement learning (RL) and the express-only-one assumption is proposed for distantly supervised RE. First, unlike removing the noisy sentences in previous studies, this paper designs Deep Q Network (DQN), a value-based RL algorithm, as a label denoiser to select the most reliable labels from the multiple relations that sentences are labeled. Second, the relation extractor applies the typical neural network model to predict relations between the data before and after the label denoiser cleans. The rewards in label denoiser are measured by the differences of prediction scores. Finally, the two modules between label denoiser and relation extractor are trained jointly to obtain correct labels and improve the extraction performance at the sentence level. The experimental results show that the proposed denoiser can deal with the noise labels of data effectively and the proposed model outperforms previous state-of-the-art baselines on both the Riedel dataset and human-annotated dataset.

Highlights

  • Relation extraction (RE) aims to predict the semantic relations between two entities from given texts

  • The experimental results show that the Deep Q Network (DQN)-based model is effective to reduce the noise labels and the proposed method outperforms various state-of-theart baselines on both the Riedel dataset and humanannotated dataset

  • A new sentence-level label denoising model based on reinforcement learning (RL) is proposed for distantly supervised relation extraction (RE)

Read more

Summary

INTRODUCTION

Relation extraction (RE) aims to predict the semantic relations between two entities from given texts. To handle the first problem, after obtained the data with the correct labels, the relation extractor predicts sentence-level relation probability. The RL-based bag-level label denoising method is proposed by Sun et al [23] to correct the noisy labels by treating a new label as the gold labels They use a policy network to correct wrong labels and extraction network to provide rewards for the corrected labels from the policy network. According to the express-only-one assumption, the label denoiser uses a value-based RL algorithm to select a reliable label for each sentence containing multiple wrong labels. A. DQN-BASED RELATION DENOISER Distantly supervised RE predicts the relations of entity pairs in an automatically generated training data.

4) OBJECTIVE FUNCTION
SENTENCE-LEVEL RELATION EXTRACTOR
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.