Decentralized multi-agent cooperation via adaptive partner modeling

Chenhang Xu,Jia Wang,Xiaohui Zhu,Yong Yue,Weifeng Zhou,Zhixuan Liang,Dominik Wojtczak

doi:10.1007/s40747-024-01421-3

Abstract

Multi-agent reinforcement learning encounters a non-stationary challenge, where agents concurrently update their policies, leading to changes in the environment. Existing approaches have tackled this challenge through communication among agents to obtain their partners’ actions, but this introduces computational complexity known as partner sample complexity. An alternative approach is to develop partner models that generate samples instead of direct communication to mitigate this complexity. However, a discrepancy arises between the real policies distribution and the policy of partner models, termed as model bias, which can significantly impact performance when heavily relying on partner models. In order to achieve a trade-off between sample complexity and performance, a novel multi-agent model-based reinforcement learning algorithm called decentralized adaptive partner modeling (DAPM) is proposed, which utilizes fictitious self play (FSP) to construct partner models and update policies. Model bias is addressed by establishing an upper bound to restrict the usage of partner models. Coupled with that, an adaptive rollout approach is introduced, enabling real agents to dynamically communicate with partner models based on their quality, ensuring that agent performance can progressively improve with partner model samples. The effectiveness of DAPM is exhibited in two multi-agent tasks, showing that DAPM outperforms existing model-free algorithms in terms of partner sample complexity and training stability. Specifically, DAPM requires 28.5% fewer communications compared to the best baseline and exhibits reduced fluctuations in the learning curve, indicating superior performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decentralized multi-agent cooperation via adaptive partner modeling

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Journal: Complex & Intelligent Systems	Publication Date: Apr 15, 2024
License type: CC BY 4.0

Similar Papers

A Data-Efficient Reinforcement Learning Method Based on Local Koopman Operators
Lixing Song ... Junheng Wang
-
Lixing Song, et. al.Lixing Song ... Junheng Wang
01 Dec 2021
01 Dec 2021

Adaptive machine learning models: Concepts for real-time financial fraud prevention in dynamic environments
Halima Oluwabunmi Bello ... Maxwell Nana Ameyaw
World Journal of Advanced Engineering Technology and Sciences | VOL. 12
Halima Oluwabunmi Bello, et. al. Halima Oluwabunmi Bello ... Maxwell Nana Ameyaw
30 Jul 2024
World Journal of Advanced Engineering Technology and Sciences | VOL. 12

Variational Model-based Policy Optimization
Yinlam Chow ... Mohammad Ghavamzadeh
-
Yinlam Chow, et. al.Yinlam Chow ... Mohammad Ghavamzadeh
01 Aug 2021
01 Aug 2021

Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control
Guoqing Ma ... Zhifu Wang
Journal of Robotics | VOL. 2022
Guoqing Ma, et. al.Guoqing Ma ... Zhifu Wang
04 Mar 2022
Journal of Robotics | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decentralized multi-agent cooperation via adaptive partner modeling

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems