Dynamic Resource Configuration for Low-Power IoT Networks: A Multi-Objective Reinforcement Learning Method

Yang Huang,Fuhui Zhou,Yijie Mao,Caiyong Hao

doi:10.1109/lcomm.2021.3074756

Dynamic Resource Configuration for Low-Power IoT Networks: A Multi-Objective Reinforcement Learning Method

Yang Huang, Fuhui Zhou + Show 2 more

Open Access

https://doi.org/10.1109/lcomm.2021.3074756

Copy DOI

Journal: IEEE Communications Letters	Publication Date: Jul 1, 2021
Citations: 8

Affiliation: Nanjing University of Aeronautics and Astronautics, Ministry of Industry and Information Technology, Southeast University, Imperial College London, State Radio Regulation Of China, Wuhan University

#Dynamic Resource Configuration #Multi-Objective Reinforcement Learning + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Considering grant-free transmissions in low-power IoT networks with unknown time-frequency distribution of interference, we address the problem of Dynamic Resource Configuration (DRC), which amounts to a Markov decision process. Unfortunately, off-the-shelf methods based on single-objective reinforcement learning cannot guarantee energy-efficient transmission, especially when all frequency-domain channels in a time interval are interfered. Therefore, we propose a novel DRC scheme where configuration policies are optimized with a Multi-Objective Reinforcement Learning (MORL) framework. Numerical results show that the average decision error rate achieved by the MORL-based DRC can be even less than 12% of that yielded by the conventional R-learning-based approach.

Full Text