Embodied imitation-enhanced reinforcement learning in multi-agent systems

Mehmet D Erbas,Larry Bull,Alan Ft Winfield

doi:10.1177/1059712313500503

Abstract

Imitation is an example of social learning in which an individual observes and copies another’s actions. This paper presents a new method for using imitation as a way of enhancing the learning speed of individual agents that employ a well-known reinforcement learning algorithm, namely Q-learning. Compared with other research that uses imitation with reinforcement learning, our method uses imitation of purely observed behaviours to enhance learning, with no internal state access or sharing of experiences between agents. The paper evaluates our imitation-enhanced reinforcement learning approach in both simulation and with real robots in continuous space. Both simulation and real robot experimental results show that the learning speed of the group is improved.

Highlights

Social learning, which enables individuals to learn from others in a community, is an important mechanism for social animals
Imitation learning differs from other adaptive learning algorithms that have been used in robotic research, including reinforcement learning (Barto et al, 2004), evolutionary algorithms (Nolfi and Floreano, 2000) and supervised learning (Rumelhart et al, 1986), as learning by imitation is based upon social interactions
This paper presents a simple method for linking reinforcement learning with imitation

Summary

Introduction

Social learning, which enables individuals to learn from others in a community, is an important mechanism for social animals. Imitation learning differs from other adaptive learning algorithms that have been used in robotic research, including reinforcement learning (Barto et al, 2004), evolutionary algorithms (Nolfi and Floreano, 2000) and supervised learning (Rumelhart et al, 1986), as learning by imitation is based upon social interactions. Another important aspect of imitation is that the only information transferred between agents is the set of observed actions. Compared to other research that uses imitation with reinforcement learning, our method uses imitation of purely observed behaviours to enhance learning, with no internal state access or sharing of experiences between agents. Both simulation and real robot experiment results show that the learning speed of the agents is improved

Simulation Setup

Imitation-Enhanced Reinforcement Learning Algorithm

Experiments

Imitating Predefined Paths

Hardware Setup

Experimental Setup

Learner Robot Copying an Expert Robot

On the Effects of Embodied Imitation

Conclusion and Discussion

Convergence Proof of Imitation-Enhanced Reinforcement Learning

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Adaptive Behavior	Publication Date: Aug 29, 2013
Citations: 14	License type: cc-by

R Discovery Prime

R Discovery Prime

Embodied imitation-enhanced reinforcement learning in multi-agent systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Adaptive Behavior

Lead the way for us

Similar Papers

Towards imitation-enhanced Reinforcement Learning in multi-agent systems
Mehmet D Erbas ... Larry Bull
-
Mehmet D Erbas, et. al.Mehmet D Erbas ... Larry Bull
01 Apr 2011
01 Apr 2011

An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
Karl Tuyls ... Bram Vanschoenwinkel
Autonomous Agents and Multi-Agent Systems | VOL. 12
Karl Tuyls, et. al.Karl Tuyls ... Bram Vanschoenwinkel
12 Sep 2005
Autonomous Agents and Multi-Agent Systems | VOL. 12

Multi-agent reinforcement learning system to find efficient courses for ships
Masahiro Nakayama ... Hisato Fujisaka
-
Masahiro Nakayama, et. al.Masahiro Nakayama ... Hisato Fujisaka
01 Nov 2014
01 Nov 2014

Review on Dec-POMDP Model for MARL Algorithms
Shen Guicheng ... Wang Yang
-
Shen Guicheng, et. al.Shen Guicheng ... Wang Yang
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Embodied imitation-enhanced reinforcement learning in multi-agent systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Adaptive Behavior