Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

Yixing Lan,Xin Xu,Qiang Fang,Yujun Zeng,Xinwang Liu,Xianjian Zhang

doi:10.1016/j.knosys.2022.108221

Abstract

Transfer reinforcement learning (RL) has recently received increasing attention to make RL agents have better learning performance in target Markov decision problems (MDPs) by using the knowledge learned in source MDPs. However, it is still an open and challenging problem to improve the transfer capability and interpretability of RL algorithms. In this paper, we propose a novel transfer reinforcement learning approach via meta-knowledge extraction using auto-pruned decision trees. In source MDPs, pre-trained policies are firstly learned via RL algorithms using general function approximators. Then, a meta-knowledge extraction algorithm is designed with an auto-pruned decision tree model, where the meta-knowledge is learned by re-training the auto-pruned decision tree based on the data samples generated from the pre-trained policies. The state spaces of meta-knowledge are determined by estimating the uncertainty of state–action pairs in pre-trained policies based on the entropy value of leaf nodes. In target MDPs, according to whether the state is in the state set of meta-knowledge, a hybrid policy is generated by integrating the meta-knowledge and the policies learned on the target MDPs. Based on the proposed transfer RL approach, two meta-knowledge-based transfer reinforcement learning (MKRL) algorithms are developed for MDPs with discrete action spaces and continuous action spaces, respectively. Experimental results in several benchmark tasks show that the MKRL algorithm outperforms other baselines in terms of learning efficiency and interpretability in the target MDPs with generic cases of task similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Jan 25, 2022
Citations: 11

Similar Papers

Action decoupled SAC reinforcement learning with discrete-continuous hybrid action spaces
Yahao Xu ... Hongbin Deng
Neurocomputing | VOL. 537
Yahao Xu, et. al.Yahao Xu ... Hongbin Deng
31 Mar 2023
Neurocomputing | VOL. 537

Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Xin Xu ... Chunming Liu
Soft Computing | VOL. 15
Xin Xu, et. al.Xin Xu ... Chunming Liu
28 Mar 2010
Soft Computing | VOL. 15

Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes
Chirath Hettiarachchi ... Christopher J Nolan
-
Chirath Hettiarachchi, et. al.Chirath Hettiarachchi ... Christopher J Nolan
01 Jan 2021
01 Jan 2021

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
...
arXiv (Cornell University) | VOL. -
, et. al. ...
04 Feb 2022
arXiv (Cornell University) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems