Deep Reinforcement Learning Evolution Algorithm for Dynamic Antenna Control in Multi-Cell Configuration HAPS System

Siyuan Yang,Kenji Hoshino,Wataru Takabatake,Yohei Shibata,Atsushi Nagate,Mondher Bouazizi,Tomoaki Ohtsuki

doi:10.3390/fi15010034

Siyuan Yang, Kenji Hoshino + Show 5 more

Open Access

https://doi.org/10.3390/fi15010034

Copy DOI

Journal: Future Internet	Publication Date: Jan 12, 2023
Citations: 3	License type: CC BY 4.0

Affiliation: Keio University, SoftBank Group (Japan)

Abstract

In this paper, we propose a novel Deep Reinforcement Learning Evolution Algorithm (DRLEA) method to control the antenna parameters of the High-Altitude Platform Station (HAPS) mobile to reduce the number of low-throughput users. Considering the random movement of the HAPS caused by the winds, the throughput of the users might decrease. Therefore, we propose a method that can dynamically adjust the antenna parameters based on the throughput of the users in the coverage area to reduce the number of low-throughput users by improving the users’ throughput. Different from other model-based reinforcement learning methods, such as the Deep Q Network (DQN), the proposed method combines the Evolution Algorithm (EA) with Reinforcement Learning (RL) to avoid the sub-optimal solutions in each state. Moreover, we consider non-uniform user distribution scenarios, which are common in the real world, rather than ideal uniform user distribution scenarios. To evaluate the proposed method, we do the simulations under four different real user distribution scenarios and compare the proposed method with the conventional EA and RL methods. The simulation results show that the proposed method effectively reduces the number of low throughput users after the HAPS moves.

Full Text