A semi-independent policies training method with shared representation for heterogeneous multi-agents reinforcement learning.

Biao Zhao,Zhang Chen,Yucheng Guo,Weiqiang Jin

doi:10.3389/fnins.2023.1201370

Abstract

Humans do not learn everything from the scratch but can connect and associate the upcoming information with the exchanged experience and known knowledge. Such an idea can be extended to cooperated multi-reinforcement learning and has achieved its success on homogeneous agents by means of parameter sharing. However, it is difficult to straightforwardly apply parameter sharing when dealing with heterogeneous agents thanks to their individual forms of input/output and their diverse functions and targets. Neuroscience has provided evidence that our brain creates several levels of experience and knowledge-sharing mechanisms that not only exchange similar experiences but also allow for sharing of abstract concepts to handle unfamiliar situations that others have already encountered. Inspired by such a brain's functions, we propose a semi-independent training policy method that can well tackle the conflict between parameter sharing and specialized training for heterogeneous agents. It employs a shared common representation for both observation and action, enabling the integration of various input and output sources. Additionally, a shared latent space is utilized to maintain a balanced relationship between the upstream policy and downstream functions, benefiting each individual agent's target. From the experiments, it can approve that our proposed method outperforms the current mainstream algorithms, especially when handling heterogeneous agents. Empirically, our proposed method can also be improved as a more general and fundamental heterogeneous agents' reinforcement learning structure for curriculum learning and representation transfer. All our code is open and released on https://gitlab.com/reinforcement/ntype.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neuroscience	Publication Date: Jun 19, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A semi-independent policies training method with shared representation for heterogeneous multi-agents reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neuroscience

Lead the way for us

Similar Papers

Weakly Supervised GAN for Image-to-Image Translation in the Wild
Zhiyi Cao ... Shaozhang Niu
Mathematical Problems in Engineering | VOL. 2020
Zhiyi Cao, et. al.Zhiyi Cao ... Shaozhang Niu
09 Mar 2020
Mathematical Problems in Engineering | VOL. 2020

Shared Latent Space of Font Shapes and Their Noisy Impressions
Jihun Kang ... Akisato Kimura
-
Jihun Kang, et. al.Jihun Kang ... Akisato Kimura
01 Jan 2021
01 Jan 2021

Multiview Metric Learning with Global Consistency and Local Smoothness
Deming Zhai ... Xilin Chen
ACM Transactions on Intelligent Systems and Technology | VOL. 3
Deming Zhai, et. al.Deming Zhai ... Xilin Chen
01 May 2012
ACM Transactions on Intelligent Systems and Technology | VOL. 3

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge IoT
Wanlu Lei ... Mikael Skoglund
IEEE Internet of Things Journal | VOL. 9
Wanlu Lei, et. al.Wanlu Lei ... Mikael Skoglund
15 Nov 2022
IEEE Internet of Things Journal | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A semi-independent policies training method with shared representation for heterogeneous multi-agents reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: Frontiers in neuroscience