Guiding Robot Exploration in Reinforcement Learning via Automated Planning

Yohei Hayamizu,Shiqi Zhang,Saeid Amiri,Kishan Chandan,Keiki Takadama

doi:10.1609/icaps.v31i1.16011

Abstract

Reinforcement learning (RL) enables an agent to learn from trial-and-error experiences toward achieving long-term goals; automated planning aims to compute plans for accomplishing tasks using action knowledge. Despite their shared goal of completing complex tasks, the development of RL and automated planning has been largely isolated due to their different computational modalities. Focusing on improving RL agents' learning efficiency, we develop Guided Dyna-Q (GDQ) to enable RL agents to reason with action knowledge to avoid exploring less-relevant states. The action knowledge is used for generating artificial experiences from an optimistic simulation. GDQ has been evaluated in simulation and using a mobile robot conducting navigation tasks in a multi-room office environment. Compared with competitive baselines, GDQ significantly reduces the effort in exploration while improving the quality of learned policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Guiding Robot Exploration in Reinforcement Learning via Automated Planning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: May 17, 2021
Citations: 7

Similar Papers

Reward Space Noise for Exploration in Deep Reinforcement Learning
Chuxiong Sun ... Xiaohui Hu
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35
Chuxiong Sun, et. al.Chuxiong Sun ... Xiaohui Hu
21 May 2021
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35

Probably Approximately Correct (PAC) exploration in reinforcement learning

-

01 Jan 2007
01 Jan 2007

Strategic Exploration in Reinforcement Learning - New Algorithms and Learning Guarantees

-

24 Feb 2020
24 Feb 2020

Efficient Exploration in Reinforcement Learning

-

07 Feb 2012
07 Feb 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Guiding Robot Exploration in Reinforcement Learning via Automated Planning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling