Multi-Virtual-Agent Reinforcement Learning for a Stochastic Predator-Prey Grid Environment

Yanbin Lin,Xiangnan Zhong,Zhen Ni

doi:10.1109/ijcnn55064.2022.9891898

Abstract

Generalization problem of reinforcement learning is crucial especially for dynamic environments. Conventional reinforcement learning methods solve the problems with some ideal assumptions and are difficult to be applied in dynamic environments directly. In this paper, we propose a new multi-virtual- agent reinforcement learning (MVARL) approach for a predator-prey grid game. The designed method can find the optimal solution even when the predator moves. Specifically, we design virtual agents to interact with simulated changing environments in parallel instead of using actual agents. Moreover, a global agent learns information from these virtual agents and interacts with the actual environment at the same time. This method can not only effectively improve the generalization performance of reinforcement learning in dynamic environments, but also reduce the overall computational cost. Two simulation studies are considered in this paper to validate the effectiveness of the designed method. We also compare the results with the conventional reinforcement learning methods. The results indicate that our proposed method can improve the robustness of reinforcement learning method and contribute to the generalization to certain extent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Virtual-Agent Reinforcement Learning for a Stochastic Predator-Prey Grid Environment

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An efficient reinforcement learning scheme for the confinement escape problem
Vignesh Gurumurthy ... Narasimhan Sundararajan
Applied Soft Computing | VOL. 152
Vignesh Gurumurthy, et. al.Vignesh Gurumurthy ... Narasimhan Sundararajan
11 Jan 2024
Applied Soft Computing | VOL. 152

Incremental Sparse Bayesian Method for Online Dialog Strategy Learning
Sungjin Lee ... Maxine Eskenazi
IEEE Journal of Selected Topics in Signal Processing | VOL. 6
Sungjin Lee, et. al.Sungjin Lee ... Maxine Eskenazi
01 Dec 2012
IEEE Journal of Selected Topics in Signal Processing | VOL. 6

An emotional model embedded reinforcement learning system
Masanao Obayashi ... Takashi Kuremoto
-
Masanao Obayashi, et. al.Masanao Obayashi ... Takashi Kuremoto
01 Oct 2012
01 Oct 2012

2A1-M09 学習に基づく3自由度1脚跳躍ロボットの運動制御に関する研究(進化・学習とロボティクス)
Makoto Watanabe ... Hiroaki Nabeshima
The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) | VOL. 2007
Makoto Watanabe, et. al.Makoto Watanabe ... Hiroaki Nabeshima
01 Jan 2007
2A1-M09 学習に基づく3自由度1脚跳躍ロボットの運動制御に関する研究(進化・学習とロボティクス)
Makoto Watanabe ... Hiroaki Nabeshima

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Virtual-Agent Reinforcement Learning for a Stochastic Predator-Prey Grid Environment

Abstract

Talk to us

Similar Papers