Communication-Efficient and Federated Multi-Agent Reinforcement Learning

Mounssif Krouka,Mehdi Bennis,Chaouki Ben Issaid,Anis Elgabli

doi:10.1109/tccn.2021.3130993

Abstract

In this paper, we consider a distributed reinforcement learning setting where agents are communicating with a central entity in a shared environment to maximize a global reward. A main challenge in this setting is that the randomness of the wireless channel perturbs each agent’s model update while multiple agents’ updates may cause interference when communicating under limited bandwidth. To address this issue, we propose a novel distributed reinforcement learning algorithm based on the alternating direction method of multipliers (ADMM) and “<i>over air aggregation</i>” using analog transmission scheme, referred to as A-RLADMM. Our algorithm incorporates the wireless channel into the formulation of the ADMM method, which enables agents to transmit each element of their updated models over the same channel using analog communication. Numerical experiments on a multi-agent collaborative navigation task show that our proposed algorithm significantly outperforms the digital communication baseline of A-RLADMM (D-RLADMM), the lazily aggregated policy gradient (RL-LAPG), as well as the analog and the digital communication versions of the vanilla FL, (A-FRL) and (D-FRL) respectively.

Highlights

Owing to the strict and stringent requirements for 5G and beyond applications such as industry 4.0, network edge intelligence is of paramount importance [1]
We focus on the fully cooperative setting, which represents a great portion of the Multi-agent reinforcement learning (MARL) settings where multiple agents interact in a shared environment and collaborate towards maximizing their rewards
Simulations results show that our proposed algorithm significantly outperforms the digital communication version of A-RLADMM (D-RLADMM), the lazily aggregated policy gradient (RL-LAPG), the digital communication version of vanilla FL (D-FRL) as well as the analog version of FL (A-FRL) since it significantly reduces the number of communication uploads

Summary

INTRODUCTION

Owing to the strict and stringent requirements for 5G and beyond applications such as industry 4.0, network edge intelligence is of paramount importance [1]. One key challenge in these applications is how to optimize distributed systems where different entities (agents) communicate wirelessly in the same environment and share limited communication resources (e.g. limited bandwidth). We focus on the fully cooperative setting, which represents a great portion of the Multi-agent reinforcement learning (MARL) settings where multiple agents interact in a shared environment and collaborate towards maximizing their rewards. MARL entails sequential decision making procedures, where agents take different actions over sequences of time in a stochastic environment. Learning where data distribution is stationary, the distribution used to sample data in the RL setting depends on timevarying policy parameters, which introduces non-stationarity and makes the problem more challenging. Many MARL algorithms were proposed to solve real-world problems such as spectrum sharing [2], 360 degree video streaming [3], multiplayer gaming [4], and robot navigation [5]

Related Works

Our Contributions

Problem Statement

Policy Parametrization

Static and Noise Free Channel

Time-varying and Noisy Channels

27: All agents in parallel: 28

A-RLADMM FRAMEWORK

NUMERICAL EVALUATION

Problem Set-up

Network and Communication Environment

Baselines

Results and Discussion

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Cognitive Communications and Networking	Publication Date: Mar 1, 2022
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Communication-Efficient and Federated Multi-Agent Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive Communications and Networking

Lead the way for us

Similar Papers

Distributed Reinforcement Learning with ADMM-RL
Peter Graf ... Christopher Bay
-
Peter Graf, et. al.Peter Graf ... Christopher Bay
01 Jul 2019
01 Jul 2019

A Survey on Distributed Reinforcement Learning
Maroning Useng ... Suleiman Avdulrahman
Mesopotamian Journal of Big Data | VOL. -
Maroning Useng, et. al.Maroning Useng ... Suleiman Avdulrahman
23 Nov 2022
Mesopotamian Journal of Big Data | VOL. -

Blockchain-Based Distributed Optimization for Energy Management Systems
Daiki Ogawa ... Koichi Kobayashi
-
Daiki Ogawa, et. al.Daiki Ogawa ... Koichi Kobayashi
01 May 2019
01 May 2019

DSP in Communications
K S Thyagarajan
-
K S ThyagarajanK S Thyagarajan
29 May 2018
29 May 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Communication-Efficient and Federated Multi-Agent Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive Communications and Networking