Multiagent Hierarchical Cognition Difference Policy for Multiagent Cooperation

Huimu Wang,Zhiqiang Pu,Jianqiang Yi,Zhen Liu

doi:10.3390/a14030098

Abstract

Multiagent cooperation is one of the most attractive research fields in multiagent systems. There are many attempts made by researchers in this field to promote cooperation behavior. However, several issues still exist, such as complex interactions among different groups of agents, redundant communication contents of irrelevant agents, which prevents the learning and convergence of agent cooperation behaviors. To address the limitations above, a novel method called multiagent hierarchical cognition difference policy (MA-HCDP) is proposed in this paper. It includes a hierarchical group network (HGN), a cognition difference network (CDN), and a soft communication network (SCN). HGN is designed to distinguish different underlying information of diverse groups’ observations (including friendly group, enemy group, and object group) and extract different high-dimensional state representations of different groups. CDN is designed based on a variational auto-encoder to allow each agent to choose its neighbors (communication targets) adaptively with its environment cognition difference. SCN is designed to handle the complex interactions among the agents with a soft attention mechanism. The results of simulations demonstrate the superior effectiveness of our method compared with existing methods.

Highlights

IntroductionBased on the common paradigm of centralized learning with decentralized execution, some multiagent reinforcement learning (MARL) algorithms learn centralized critics for multiple agents and determine the decentralized action
TRANSFER considers the influence of communication among different agents, it ignores the influence of redundant communication, which make agents trained with TRANSFER obtain higher rewards and converge slower than multiagent deep deterministic policy gradient (MADDPG)
TRANSFER considers the influence of communication among different agents, it ignores the influence of redundant communication, which makes agents trained with TRANSFER obtain higher rewards and converge slower than MADDPG

Summary

Introduction

Based on the common paradigm of centralized learning with decentralized execution, some MARL algorithms learn centralized critics for multiple agents and determine the decentralized action When these methods are applied to environments with a large number of agents, they have their limitations. Agents need to cooperate with each other to complete different tasks in partially observable environments, which are considered as partially observable Markov games that are an extension of Markov games [23] They are defined by tenvironment state St , action spaces At = a1t , · · · , atN where N is the number of agents, ai is the action t = o t , . Each agent i learns a of agent i at time t, and observation spaces

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiagent Hierarchical Cognition Difference Policy for Multiagent Cooperation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Journal: Algorithms	Publication Date: Mar 21, 2021
License type: CC BY 4.0

Similar Papers

Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
Huimu Wang ... Zhiqiang Pu
-
Huimu Wang, et. al.Huimu Wang ... Zhiqiang Pu
18 Jul 2021
18 Jul 2021

SHA-MTL: soft and hard attention multi-task learning for automated breast cancer ultrasound image segmentation and classification.
Guisheng Zhang ... Kehui Zhao
International Journal of Computer Assisted Radiology and Surgery | VOL. 16
Guisheng Zhang, et. al.Guisheng Zhang ... Kehui Zhao
12 Jul 2021
International Journal of Computer Assisted Radiology and Surgery | VOL. 16

Social interaction of cooperative communication and group generation in multi-agent reinforcement learning systems
Kun Zhang ... Yoichiro Maeda
-
Kun Zhang, et. al.Kun Zhang ... Yoichiro Maeda
01 Jun 2011
01 Jun 2011

Cooperative Behavior Acquisition in Multi-agent Reinforcement Learning System Using Attention Degree
Kunikazu Kobayashi ... Masanao Obayashi
-
Kunikazu Kobayashi, et. al.Kunikazu Kobayashi ... Masanao Obayashi
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiagent Hierarchical Cognition Difference Policy for Multiagent Cooperation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms