Abstract

Multi-hop knowledge graph question answering (KGQA) aims to pinpoint answer entities by reasoning across multiple triples in knowledge graphs (KGs). To enhance model interpretability, reinforcement learning (RL) based methods are proposed. However, RL-based solutions suffer from two major issues. The first is aimless exploration due to the limited observation information. The second is delayed and sparse rewards caused by weak supervision. To bridge the gap, we propose an interpretable model based on RL for multi-hop KGQA with weak supervision, which we refer to as Anticipation Reasoning Network (ARN) that incorporates KG embeddings as anticipation information into RL framework to avoid aimlessly traversing in KG. ARN achieves the goal via three steps. (1) KG embedding models represent the entities and relations as low-dimensional vectors. (2) We propose a knowledge embedding based question answering (KEQA) framework to measure the plausibility of the “triple” <topic entity, question, answer entity> in embedding space. (3) A policy-guided agent is designed to perform effective path reasoning by selecting the most promising action at each step. Moreover, we utilize reward shaping strategy and actor-critic optimization to alleviate the problem of low-quality rewards. Extensive experiments well demonstrate the effectiveness of our method footnoteFor reproducibility, we publicly release the code athttps://github.com/iDylanCui/ARN.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call