강화 학습에 기초한 로봇 축구 에이전트의 설계 및 구현

In-Cheol Kim

doi:10.3745/kipstb.2002.9b.2.139

Abstract

The robot soccer simulation game is a dynamic multi-agent environment. In this paper we suggest a new reinforcement learning approach to each agent`s dynamic positioning in such dynamic environment. Reinforcement learning is the machine learning in which an agent learns from indirect, delayed reward an optimal policy to choose sequences of actions that produce the greatest cumulative reward. Therefore the reinforcement learning is different from supervised learning in the sense that there is no presentation of input-output pairs as training examples. Furthermore, model-free reinforcement learning algorithms like Q-learning do not require defining or learning any models of the surrounding environment. Nevertheless these algorithms can learn the optimal policy if the agent can visit every state-action pair infinitely. However, the biggest problem of monolithic reinforcement learning is that its straightforward applications do not successfully scale up to more complex environments due to the intractable large space of states. In order to address this problem, we suggest Adaptive Mediation-based Modular Q-Learning (AMMQL) as an improvement of the existing Modular Q-Learning (MQL). While simple modular Q-learning combines the results from each learning module in a fixed way, AMMQL combines them in a more flexible way by assigning different weight to each module according to its contribution to rewards. Therefore in addition to resolving the problem of large state space effectively, AMMQL can show higher adaptability to environmental changes than pure MQL. In this paper we use the AMMQL algorithn as a learning method for dynamic positioning of the robot soccer agent, and implement a robot soccer agent system called Cogitoniks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

강화 학습에 기초한 로봇 축구 에이전트의 설계 및 구현

Abstract

Talk to us

Similar Papers

More From: The KIPS Transactions:PartB

Lead the way for us

Similar Papers

Reinforcement Learning in System Identification
Mariela Cerrada ... Jose Aguilar
-
Mariela Cerrada, et. al.Mariela Cerrada ... Jose Aguilar
01 Jan 2008
01 Jan 2008

PMA-DRL: A parallel model-augmented framework for deep reinforcement learning algorithms
Xufang Luo ... Yunhong Wang
Neurocomputing | VOL. 403
Xufang Luo, et. al.Xufang Luo ... Yunhong Wang
25 Apr 2020
Neurocomputing | VOL. 403

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Reinforcement Learning Based Decision Making of Operational Indices in Process Industry Under Changing Environment
Chao Liu ... Jiyuan Sun
IEEE Transactions on Industrial Informatics | VOL. 17
Chao Liu, et. al.Chao Liu ... Jiyuan Sun
29 Jun 2020
IEEE Transactions on Industrial Informatics | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

강화 학습에 기초한 로봇 축구 에이전트의 설계 및 구현

Abstract

Talk to us

Similar Papers

More From: The KIPS Transactions:PartB