Modular Reinforcement Learning: An Application to a Real Robot Task

Zsolt Kalmár,Csaba Szepesvári,András Lorincz

doi:10.1007/3-540-49240-2_3

Abstract

The behaviour of reinforcement learning (RL) algorithms is best understood in completely observable, finite state- and action-space, discrete-time controlled Markov-chains. Robot-learning domains, on the other hand, are inherently infinite both in time and space, and moreover they are only partially observable. In this article we suggest a systematic design method whose motivation comes from the desire to transform the task-to-be-solved into a finite-state, discrete-time, “approximately” Markovian task, which is completely observable too. The key idea is to break up the problem into subtasks and design controllers for each of the subtasks. Then operating conditions are attached to the controllers (together the controllers and their operating conditions which are called modules) and possible additional features are designed to facilitate observability. A new discrete time-counter is introduced at the “module-level” that clicks only when a change in the value of one of the features is observed. The approach was tried out on a real-life robot. Several RL algorithms were compared and it was found that a model-based approach worked best. The learnt switching strategy performed equally well as a handcrafted version. Moreover, the learnt strategy seemed to exploit certain properties of the environment which could not have been seen in advance, which predicted the promising possibility that a learnt controller might overperform a handcrafted switching strategy in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modular Reinforcement Learning: An Application to a Real Robot Task

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Module-Based Reinforcement Learning: Experiments with a Real Robot
Zsolt Kalmár ... András Lörincz
Machine Learning | VOL. 31
Zsolt Kalmár, et. al.Zsolt Kalmár ... András Lörincz
01 Jan 1998
Machine Learning | VOL. 31

Dynamic Economic Optimization of a Continuously Stirred Tank Reactor Using Reinforcement Learning
Derek Machalek ... Titus Quah
-
Derek Machalek, et. al.Derek Machalek ... Titus Quah
01 Jul 2020
01 Jul 2020

Robust Deep Reinforcement Learning for Security and Safety in Autonomous Vehicle Systems
Aidin Ferdowsi ... Walid Saad
-
Aidin Ferdowsi, et. al.Aidin Ferdowsi ... Walid Saad
01 Nov 2018
01 Nov 2018

PMA-DRL: A parallel model-augmented framework for deep reinforcement learning algorithms
Xufang Luo ... Yunhong Wang
Neurocomputing | VOL. 403
Xufang Luo, et. al.Xufang Luo ... Yunhong Wang
25 Apr 2020
Neurocomputing | VOL. 403

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modular Reinforcement Learning: An Application to a Real Robot Task

Abstract

Talk to us

Similar Papers