Deep Reinforcement Learning for Practical Phase-Shift Optimization in RIS-Aided MISO URLLC Systems

Ramin Hashemi,Samad Ali,Matti Latva-Aho,Nurul Huda Mahmood

doi:10.1109/jiot.2022.3232962

Ramin Hashemi, Samad Ali + Show 2 more

Open Access

https://doi.org/10.1109/jiot.2022.3232962

Copy DOI

Journal: IEEE Internet of Things Journal	Publication Date: May 15, 2023
Citations: 9	License type: CC BY 4.0

Affiliation: University of Oulu

Abstract

We study the joint active/passive beamforming and channel blocklength (CBL) allocation in a non-ideal reconfigurable intelligent surface (RIS)-aided ultra-reliable and low-latency communication (URLLC) system. The considered scenario is a finite blocklength (FBL) regime and the problem is solved by leveraging a deep reinforcement learning (DRL) algorithm named twin-delayed deep deterministic policy gradient (TD3). First, assuming an industrial automation system, the signal-to-interference-plus-noise ratio and achievable rate in the FBL regime are identified for each actuator. Next, the joint active/passive beamforming and CBL optimization problem is formulated where the objective is to maximize the total achievable FBL rate in all actuators, subject to non-linear amplitude response at the RIS elements, BS transmit power budget and total available CBL. Since the formulated problem is highly non-convex and non-linear, we resort to employing an actor-critic policy gradient DRL algorithm based on TD3. The considered method relies on interacting RIS with the industrial automation environment by taking actions which are the phase shifts at the RIS elements, CBL variables, and BS beamforming to maximize the expected observed reward, i.e., the total FBL rate. We assess the performance loss of the system when the RIS is non-ideal, i.e., with non-linear amplitude response, and compare it with ideal RIS without impairments. The numerical results show that optimizing the RIS phase shifts, BS beamforming, and CBL variables via the TD3 method with deterministic policy outperforms conventional methods and it is highly beneficial for improving the network total FBL rate considering finite CBL size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning for Practical Phase-Shift Optimization in RIS-Aided MISO URLLC Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal

Lead the way for us

Similar Papers

Deep Reinforcement Learning for Practical Phase Shift Optimization in RIS-assisted Networks over Short Packet Communications
Ramin Hashemi ... Nurul Huda Mahmood
-
Ramin Hashemi, et. al.Ramin Hashemi ... Nurul Huda Mahmood
07 Jun 2022
07 Jun 2022

Average Rate and Error Probability Analysis in Short Packet Communications Over RIS-Aided URLLC Systems
Ramin Hashemi ... Matti Latva-Aho
IEEE Transactions on Vehicular Technology | VOL. 70
Ramin Hashemi, et. al.Ramin Hashemi ... Matti Latva-Aho
01 Oct 2021
IEEE Transactions on Vehicular Technology | VOL. 70

Average Rate Analysis of RIS-aided Short Packet Communication in URLLC Systems
Ramin Hashemi ... Samad Ali
-
Ramin Hashemi, et. al.Ramin Hashemi ... Samad Ali
01 Jun 2021
01 Jun 2021

Joint Resource Allocation and Phase Shift Optimization for RIS-Aided eMBB/URLLC Traffic Multiplexing
Mohammed Almekhlafi ... Chadi Assi
IEEE Transactions on Communications | VOL. 70
Mohammed Almekhlafi, et. al.Mohammed Almekhlafi ... Chadi Assi
01 Feb 2022
IEEE Transactions on Communications | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning for Practical Phase-Shift Optimization in RIS-Aided MISO URLLC Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal