Scalable lifelong reinforcement learning

Yusen Zhan,Haitham Bou Ammar,Matthew E Taylor

doi:10.1016/j.patcog.2017.07.031

Yusen Zhan, Haitham Bou Ammar + Show 1 more

Open Access

https://doi.org/10.1016/j.patcog.2017.07.031

Copy DOI

Journal: Pattern Recognition	Publication Date: Jul 29, 2017
Citations: 19	License type: publisher-specific-oa

Affiliation: Washington State University

Abstract

Lifelong reinforcement learning provides a successful framework for agents to learn multiple consecutive tasks sequentially. Current methods, however, suffer from scalability issues when the agent has to solve a large number of tasks. In this paper, we remedy the above drawbacks and propose a novel scalable technique for lifelong reinforcement learning. We derive an algorithm which assumes the availability of multiple processing units and computes shared repositories and local policies using only local information exchange. We then show an improvement to reach a linear convergence rate compared to current lifelong policy search methods. Finally, we evaluate our technique on a set of benchmark dynamical systems and demonstrate learning speed-ups and reduced running times.

Full Text