Generative subgoal oriented multi-agent reinforcement learning through potential field

Shengze Li,Hao Jiang,Yuntao Liu,Jieyuan Zhang,Xinhai Xu,Donghong Liu

doi:10.1016/j.neunet.2024.106552

Abstract

Multi-agent reinforcement learning (MARL) effectively improves the learning speed of agents in sparse reward tasks with the guide of subgoals. However, existing works sever the consistency of the learning objectives of the subgoal generation and subgoal reached stages, thereby significantly inhibiting the effectiveness of subgoal learning. To address this problem, we propose a novel Potential field Subgoal-based Multi-Agent reinforcement learning (PSMA) method, which introduces the potential field (PF) to unify the two-stage learning objectives. Specifically, we design a state-to-PF representation model that describes agents’ states as potential fields, allowing easy measurement of the interaction effect for both allied and enemy agents. With the PF representation, a subgoal selector is designed to automatically generate multiple subgoals for each agent, drawn from the experience replay buffer that contains both individual and total PF values. Based on the determined subgoals, we define an intrinsic reward function to guide the agent to reach their respective subgoals while maximizing the joint action-value. Experimental results show that our method outperforms the state-of-the-art MARL method on both StarCraft II micro-management (SMAC) and Google Research Football (GRF) tasks with sparse reward settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generative subgoal oriented multi-agent reinforcement learning through potential field

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: Jul 17, 2024
License type: cc-by-nc-nd

Similar Papers

Lessons learned in single-agent and multiagent learning with robot foraging
Z Ren ... A.B Williams
-
Z Ren, et. al.Z Ren ... A.B Williams
10 Nov 2003
10 Nov 2003

SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward
Xin He ... Hongwei Ge
-
Xin He, et. al.Xin He ... Hongwei Ge
01 Aug 2024
01 Aug 2024

Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method.
Guang Liao ... Junan Yang
Sensors (Basel, Switzerland) | VOL. 24
Guang Liao, et. al.Guang Liao ... Junan Yang
25 Oct 2024
Sensors (Basel, Switzerland) | VOL. 24

Graph-based multi-agent reinforcement learning for collaborative search and tracking of multiple UAVs
Bocheng Zhao ... Shaohai Wang
Chinese Journal of Aeronautics | VOL. -
Bocheng Zhao, et. al.Bocheng Zhao ... Shaohai Wang
01 Aug 2024
Chinese Journal of Aeronautics | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generative subgoal oriented multi-agent reinforcement learning through potential field

Abstract

Talk to us

Similar Papers

More From: Neural Networks