Abstract

A reconfigurable intelligent surface (RIS)-assisted downlink non-orthogonal multiple access (NOMA) Internet-of-Things (IoT) network is proposed, where a quality-of-service (QoS)-based NOMA clustering scheme is conceived to effectively utilize the limited wireless resources among IoT devices. A throughput maximization problem is formulated by jointly optimizing the phase shifts of the RIS and the power allocation of the base station (BS) from the short-term and long-term perspectives. We aim to investigate and compare the performance of deep learning (DL) and deep reinforcement learning (DRL) algorithms for solving the formulated problems. In particular, the DL method utilizes model-agnostic-meta-learning (MAML) to enhance the generalization capability of the neural network and to accelerate the convergence rate. For the DRL method, the deep deterministic policy gradient (DDPG) algorithm is employed to incorporate continuous phase-shift variables. It shows that the DL method only focuses on the maximization of the instantaneous throughput, whereas the DRL method can coordinate the power consumption over different time slots to maximize the long-term throughput. Numerical results demonstrate that 1) the proposed QoS-based NOMA clustering scheme achieves higher IoT throughput than the conventional channel-based scheme; 2) the implementation of RISs induces approximately 5% to 25% throughput gain as the number of RIS elements increases from 8 to 64; 3) DL and DRL achieve a similar throughput performance for the short-term optimization, while DRL is superior for the long-term optimization, especially when the total transmit power is limited.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call