Deep Reinforcement Learning Algorithms for Machine-to-Machine Communications: A Review

Devarani Devi Ningombam

doi:10.1109/icccnt54827.2022.9984457

Devarani Devi Ningombam

https://doi.org/10.1109/icccnt54827.2022.9984457

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Automated data transfer and measurement between multiple devices are accomplished through Machine- to-machine (M2M) communications, which rely on zero or minimal human intervention. M2M communication offers a plethora of benefits and opportunities, including the ability to handle a wide range of data and large volumes, the ability to learn on their own, and better decision making. In spite of these advantages, M2M faces major challenges such as communication delay, data acquisition mismatching, the requirement of additional resources, and is highly susceptible to errors. To handle these challenges, in this work, we discuss various state-of-the-art deep reinforcement learning (DRL) algorithms. Deep Q-learning (DQN), dueling DQN, multi-step DQN, actor-critic (AC), advantage AC, REINFORCE, trustregion policy optimization (TRPO), and proximal policy optimization (PPO) algorithms are investigated.

Full Text