Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA

Sami Khairy,Lin X Cai,Prasanna Balaprakash,Yu Cheng

doi:10.1109/jsac.2020.3018804

Abstract

In this paper, we apply the Non-Orthogonal Multiple Access (NOMA) technique to improve the massive channel access of a wireless IoT network where solar-powered Unmanned Aerial Vehicles (UAVs) relay data from IoT devices to remote servers. Specifically, IoT devices contend for accessing the shared wireless channel using an adaptive $p$-persistent slotted Aloha protocol; and the solar-powered UAVs adopt Successive Interference Cancellation (SIC) to decode multiple received data from IoT devices to improve access efficiency. To enable an energy-sustainable capacity-optimal network, we study the joint problem of dynamic multi-UAV altitude control and multi-cell wireless channel access management of IoT devices as a stochastic control problem with multiple energy constraints. To learn an optimal control policy, we first formulate this problem as a Constrained Markov Decision Process (CMDP), and propose an online model-free Constrained Deep Reinforcement Learning (CDRL) algorithm based on Lagrangian primal-dual policy optimization to solve the CMDP. Extensive simulations demonstrate that our proposed algorithm learns a cooperative policy among UAVs in which the altitude of UAVs and channel access probability of IoT devices are dynamically and jointly controlled to attain the maximal long-term network capacity while maintaining energy sustainability of UAVs. The proposed algorithm outperforms Deep RL based solutions with reward shaping to account for energy costs, and achieves a temporal average system capacity which is $82.4\%$ higher than that of a feasible DRL based solution, and only $6.47\%$ lower compared to that of the energy-constraint-free system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal on Selected Areas in Communications	Publication Date: Nov 23, 2020
Citations: 100	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Communications

Lead the way for us

Similar Papers

Deep Reinforcement Learning for Aerial Data Collection in Hybrid-Powered NOMA-IoT Networks
Zhanpeng Zhang ... Runze Wu
IEEE Internet of Things Journal | VOL. 10
Zhanpeng Zhang, et. al.Zhanpeng Zhang ... Runze Wu
15 Jan 2023
IEEE Internet of Things Journal | VOL. 10

UAV Assisted Spectrum Sharing Ultra-Reliable and Low-Latency Communications
Zheng Chu ... Wanming Hao
-
Zheng Chu, et. al.Zheng Chu ... Wanming Hao
01 Dec 2019
01 Dec 2019

Antenna Selection and Device Grouping for Spectrum-Efficient UAV-Assisted IoT Systems
Dinh-Thuan Do ... Shahid Mumtaz
IEEE Internet of Things Journal | VOL. 10
Dinh-Thuan Do, et. al.Dinh-Thuan Do ... Shahid Mumtaz
01 May 2023
IEEE Internet of Things Journal | VOL. 10

Spectrally Efficient Uplink Cooperative NOMA With Joint Decoding for Relay-Assisted IoT Networks
Jeong Seon Yeom ... Bang Chul Jung
IEEE Internet of Things Journal | VOL. 10
Jeong Seon Yeom, et. al.Jeong Seon Yeom ... Bang Chul Jung
01 Jan 2023
IEEE Internet of Things Journal | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Communications