Zero-shot policy generation in lifelong reinforcement learning

Yi-Ming Qian,Fang-Zhou Xiong,Zhi-Yong Liu

doi:10.1016/j.neucom.2021.02.058

Abstract

Lifelong reinforcement learning (LRL) is an important approach to achieve continual lifelong learning of multiple reinforcement learning tasks. The two major methods used in LRL are task decomposition and policy knowledge extraction. Policy knowledge extraction method in LRL can share knowledge for tasks in different task domains and for tasks in the same task domain with different system environmental coefficients. However, the generalization ability of policy knowledge extraction method is limited on learned tasks rather than learned task domains. In this paper, we propose a cross-domain lifelong reinforcement learning algorithm with zero-shot policy generation ability (CDLRL-ZPG) to improve generalization ability of policy knowledge extraction method from learned tasks to learned task domains. In experiments, we evaluated CDLRL-ZPG performance on four task domains. And our results show that the proposed algorithm can directly generate satisfactory results without needing a trial and error learning process to achieve zero-shot learning in general.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Zero-shot policy generation in lifelong reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Mar 10, 2021
Citations: 5

Similar Papers

Intra-domain Knowledge Generalization in Cross-Domain Lifelong Reinforcement Learning
Yiming Qian ... Fangzhou Xiong
-
Yiming Qian, et. al.Yiming Qian ... Fangzhou Xiong
01 Jan 2020
01 Jan 2020

Lifelong reinforcement learning with temporal logic formulas and reward machines
Xuejing Zheng ... Minjie Zhang
Knowledge-Based Systems | VOL. 257
Xuejing Zheng, et. al.Xuejing Zheng ... Minjie Zhang
11 Aug 2022
Knowledge-Based Systems | VOL. 257

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games
Indranil Sur ... Zachary Daniels
-
Indranil Sur, et. al.Indranil Sur ... Zachary Daniels
12 Oct 2022
12 Oct 2022

Image Morphing: Transfer Learning between Tasks That Have Multiple Outputs
Daniel L Silver ... Liangliang Tu
-
Daniel L Silver, et. al.Daniel L Silver ... Liangliang Tu
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-shot policy generation in lifelong reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Neurocomputing