In the process of building domain knowledge graph, the result of relationship extraction between entities is an important guarantee of the quality of the graph. Therefore, we propose a clustering method based on reinforcement learning for remote supervised relation extraction. For the relationship extraction of accident information in the aviation domain mapping, a clustering method combining local dense and global dissimilarity is proposed in combination with remote supervision, which can obtain a large amount of low-noise labeled data and reduce part of the wrong labeling and missing labeling due to the strong specialization in the aviation domain; meanwhile, reinforcement learning is introduced to denoise the negative instance noise in the positive sample data; Finally, we propose a two-attention segmentation (DAPCNN) relationship extraction model to mine deep semantic sentences. The experimental results show that in the civil aviation relationship extraction text constructed in this paper, the Micro_R, Micro_P and Micro_F1 values of the proposed relationship extraction method reach 83.41 %, 84.16 % and 83.96 %. In the open relationship extraction dataset DuIE, The Micro_R, Micro_P and Micro_F1 of the proposed method are up to 83.41 %, 93.58 % and 94.02 % respectively. Compared with the current advanced multi-instance and multi-label model, the proposed method can more accurately extract the relationship between aviation accident entities. At the same time, the performance of the open data set is also good, and has a certain universality.
Read full abstract