Efficient Behavior Learning by Utilizing Estimated State Value of Self and Teammates

Kouki Shimada,Yasutake Takahashi,Minoru Asada

doi:10.1007/978-3-642-11876-0_31

Abstract

Reinforcement learning applications to real robots in multi-agent dynamic environments are limited because of huge exploration space and enormously long learning time. One of the typical examples is a case of RoboCup competitions since other agents and their behavior easily cause state and action space explosion. This paper presents a method that utilizes state value functions of macro actions to explore appropriate behavior efficiently in a multi-agent environment by which the learning agent can acquire cooperative behavior with its teammates and competitive ones against its opponents. The key ideas are as follows. First, the agent learns a few macro actions and the state value functions based on reinforcement learning beforehand. Second, an appropriate initial controller for learning cooperative behavior is generated based on the state value functions. The initial controller utilizes the state values of the macro actions so that the learner tends to select a good macro action and not select useless ones. By combination of the ideas and a two-layer hierarchical system, the proposed method shows better performance during the learning than conventional methods. This paper shows a case study of 4 (defense team) on 5 (offense team) game task, and the learning agent (a passer of the offense team) successfully acquired the teamwork plays (pass and shoot) within shorter learning time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Behavior Learning by Utilizing Estimated State Value of Self and Teammates

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Rapid behavior learning in multi-agent environment based on state value estimation of others
Yasutake Takahashi ... Minoru Asada
-
Yasutake Takahashi, et. al.Yasutake Takahashi ... Minoru Asada
01 Oct 2007
01 Oct 2007

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
Kentaro Noma ... Minoru Asada
-
Kentaro Noma, et. al.Kentaro Noma ... Minoru Asada
01 Jan 2008
01 Jan 2008

Efficient Behavior Learning Based on State Value Estimation of Self and Others
Yasutake Takahashi ... Minoru Asada
Advanced Robotics | VOL. 22
Yasutake Takahashi, et. al.Yasutake Takahashi ... Minoru Asada
01 Jan 2008
Advanced Robotics | VOL. 22

Reusability and Transferability of Macro Actions for Reinforcement Learning
Yi-Hsiang Chang ... Kuan-Yu Chang
ACM Transactions on Evolutionary Learning and Optimization | VOL. 2
Yi-Hsiang Chang, et. al.Yi-Hsiang Chang ... Kuan-Yu Chang
31 Mar 2022
ACM Transactions on Evolutionary Learning and Optimization | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Behavior Learning by Utilizing Estimated State Value of Self and Teammates

Abstract

Talk to us

Similar Papers