Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning

Delin Guo,Ying-Chang Liang,Xinggan Zhang,Lan Tang

doi:10.1109/tvt.2020.3020400

Abstract

In this paper, we study the handover (HO), and power allocation problem in a two-tier heterogeneous network (HetNet), which consists of a macro base station, and some millimeter-wave (mmWave) small base stations. We establish an HO management, and power allocation scheme to maximize the overall throughput while reducing the HO frequency. In particular, considering the interrelationship among decisions made by different user equipments (UEs), we first model the HO, and power allocation problem as a fully cooperative multi-agent task, in which all agents, i.e., UEs, have the same target. Then, to solve the multi-agent task, and get decentralized policies for each UE, we develop a multi-agent reinforcement learning (MARL) algorithm based on the proximal policy optimization (PPO) method, by introducing the centralized training with decentralized execution framework. That is, we use global information to train policies for each UE, and after the training is finished, each UE obtains a decentralized policy, which can be implemented only based on each UE's local observation. Specially, we introduce the counterfactual baseline to address the credit assignment problem in centralized learning. Due to the centralized training, the decentralized polices learned by multi-agent PPO (MAPPO) can work more cooperatively. Finally, the simulation results demonstrate that our method can achieve better performance comparing with other existing works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology

Lead the way for us

Journal: IEEE Transactions on Vehicular Technology	Publication Date: Sep 3, 2020
Citations: 125

Similar Papers

Unnecessary handover minimization in two-tier heterogeneous networks
Mohanad Alhabo ... Li Zhang
-
Mohanad Alhabo, et. al.Mohanad Alhabo ... Li Zhang
01 Feb 2017
01 Feb 2017

Rules-PPO-QMIX: Multi-Agent Reinforcement Learning with Mixed Rules for Large Scene Tasks
Zi-Zhen Shen ... Rui Yu
-
Zi-Zhen Shen, et. al.Zi-Zhen Shen ... Rui Yu
22 Oct 2021
22 Oct 2021

Optimization of Resource Allocation in Multi-Cell OFDM Systems: A Distributed Reinforcement Learning Approach
Yuntao Hu ... Mingzhe Chen
-
Yuntao Hu, et. al.Yuntao Hu ... Mingzhe Chen
01 Aug 2020
01 Aug 2020

Multi-Agent Reinforcement Learning Based Fully Decentralized Dynamic Time Division Configuration for 5G and B5G Network.
Xiangyu Chen ... Gang Chuai
Sensors (Basel, Switzerland) | VOL. 22
Xiangyu Chen, et. al.Xiangyu Chen ... Gang Chuai
23 Feb 2022
Sensors (Basel, Switzerland) | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Vehicular Technology