Generalized model learning for Reinforcement Learning on a humanoid robot

Todd Hester,Peter Stone,Michael Quinlan

doi:10.1109/robot.2010.5509181

Abstract

Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obvious enticement is that the robot should be able to improve its own behavior without the need for detailed step-by-step programming. However, for RL to reach its full potential, the algorithms must be sample efficient: they must learn competent behavior from very few real-world trials. From this perspective, model-based methods, which use experiential data more efficiently than model-free approaches, are appealing. But they often require exhaustive exploration to learn an accurate model of the domain. In this paper, we present an algorithm, Reinforcement Learning with Decision Trees (RL-DT), that uses decision trees to learn the model by generalizing the relative effect of actions across states. The agent explores the environment until it believes it has a reasonable policy. The combination of the learning approach with the targeted exploration policy enables fast learning of the model. We compare RL-DT against standard model-free and model-based learning methods, and demonstrate its effectiveness on an Aldebaran Nao humanoid robot scoring goals in a penalty kick scenario.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized model learning for Reinforcement Learning on a humanoid robot

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Online learning algorithms : For passivity-based and distributed control

-

03 May 2016
03 May 2016

Tree-Based Learning of Regulatory Network Topologies and Dynamics with Jump3.
Vân Anh Huynh-Thu ... Guido Sanguinetti
Methods in molecular biology (Clifton, N.J.) | VOL. 1883
Vân Anh Huynh-Thu, et. al.Vân Anh Huynh-Thu ... Guido Sanguinetti
14 Dec 2018
Methods in molecular biology (Clifton, N.J.) | VOL. 1883

Model-Based or Model-Free, a Review of Approaches in Reinforcement Learning
Qingyan Huang
-
Qingyan HuangQingyan Huang
01 Aug 2020
01 Aug 2020

Multi-Agent Optimal Control for Central Chiller Plants Using Reinforcement Learning and Game Theory
Shunian Qiu ... Zhihong Pang
Systems | VOL. 11
Shunian Qiu, et. al.Shunian Qiu ... Zhihong Pang
03 Mar 2023
Systems | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized model learning for Reinforcement Learning on a humanoid robot

Abstract

Talk to us

Similar Papers