Abstract

Question-answering systems based on knowledge graphs are extremely challenging tasks in the field of natural language processing. Most of the existing Chinese Knowledge Base Question Answering(KBQA) can only return the knowledge stored in the knowledge base by extractive methods. Nevertheless, this processing does not conform to the reading habits and cannot solve the Out-of-vocabulary(OOV) problem. In this paper, a new generative question answering method based on knowledge graph is proposed, including three parts of knowledge vocabulary construction, data pre-processing, and answer generation. In the word list construction, BiLSTM-CRF is used to identify the entity in the source text, finding the triples contained in the entity, counting the word frequency, and constructing it. In the part of data pre-processing, a pre-trained language model BERT combining word frequency semantic features is adopted to obtain word vectors. In the answer generation part, one combination of a vocabulary constructed by the knowledge graph and a pointer generator network(PGN) is proposed to point to the corresponding entity for generating answer. The experimental results show that the proposed method can achieve superior performance on WebQA datasets than other methods.

Highlights

  • Natural language processing is a hot research field in artificial intelligence

  • Methods based on semantic analysis often require vocabulary mapping and construction of a syntax tree to transform natural language into a semantic representation that can be understood by machine language, so as to perform reasoning or query to get the correct answer

  • The question sentence is first passed into the BERT word frequency feature, and entered into the Encoder-Decoder model based on Attention

Read more

Summary

Introduction

Natural language processing is a hot research field in artificial intelligence. A questionanswering system, as a sub-field of natural language processing, studies how to interact with machines naturally. Wei et al [15] proposed a new method based on multi-instance learning to solve the problem of noisy answers by exploring the consistency between the answers to the same question in the training end-to-end KBQA model. These methods can only extract knowledge from existing data and return them as answers, and the returned results are simple. To solve these problems, this paper proposes to use the end-to-end generative question answering model for knowledge graph question answering.

Related Work
Construction of Generative
Knowledge Vocabulary Construction
Vocabulary Construction
Answer
Pointer-Generator Network Model
Datasets
Parameter Setting
Entity Recognition Module
Method
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.