Age of correlated information-optimal dynamic policy scheduling for sustainable Green IoT devices: A multi-agent deep reinforcement learning approach

Sandip Roy,Abhishek Bisht,Ashok Kumar Das,Sachin Shetty,M Shamim Hossain

doi:10.1016/j.iot.2024.101141

Abstract

The rapid progress in communication technologies and the widespread adoption of connected devices have given rise to various information-centric Internet-of-Things (IoT) systems, typically necessitating timely updates of information. Green IoT (G-IoT) strives to enhance the environment by reducing the power consumption of billions of devices engaged in extensive data exchange, addressing the substantial energy demand in the process. The Age of Correlated Information (AoCI) measures the freshness of information shared among two or more devices that contribute to the same decision-making process. Optimizing AoCI using Deep Reinforcement Learning (DRL) in IoT reduces energy consumption, optimizes resource utilization, and promotes environmentally conscious communication, contributing to the development of a sustainable G-IoT system. This paper focuses on scheduling the transmission of status update packets among interconnected G-IoT devices to minimize the application-specific long-term average AoCI. The problem is modeled as an NP-hard episodic Markov Decision Process (MDP), highlighting its computational complexity. To handle correlations and the curse of dimensionality, a multi-agent deep reinforcement learning algorithm, specifically the Multi-Agent Deep Deterministic Policy Grading (MADDPG) algorithm, is developed. The training progress displays episode rewards, with an environment designed to penalize multiple agents transmitting simultaneously or none at all, promoting cooperative behavior and minimizing the average age of correlated information. We provide comprehensive simulation results, including reward convergence, the learning process of actors and critics, and the resulting average AoCI with the number of episodes, demonstrating the effectiveness of the MADDPG algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Age of correlated information-optimal dynamic policy scheduling for sustainable Green IoT devices: A multi-agent deep reinforcement learning approach

Abstract

Talk to us

Similar Papers

More From: Internet of Things

Lead the way for us

Journal: Internet of Things	Publication Date: Mar 4, 2024
Citations: 2

Similar Papers

Power Allocation and Energy Cooperation for UAV-Enabled MmWave Networks: A Multi-Agent Deep Reinforcement Learning Approach.
Mari Carmen Domingo
Sensors | VOL. 22
Mari Carmen DomingoMari Carmen Domingo
30 Dec 2021
Sensors | VOL. 22

Independent Learning Approaches: Overcoming Multi-Agent Learning Pathologies In Team-Games

-

06 Mar 2020
06 Mar 2020

A Delay-Optimal Task Scheduling Strategy for Vehicle Edge Computing Based on the Multi-Agent Deep Reinforcement Learning Approach
Xuefang Nie ... Yunhui Yan
Electronics | VOL. 12
Xuefang Nie, et. al.Xuefang Nie ... Yunhui Yan
31 Mar 2023
Electronics | VOL. 12

A Joint Service Migration and Mobility Optimization Approach for Vehicular Edge Computing
Quan Yuan ... Jinglin Li
IEEE Transactions on Vehicular Technology | VOL. 69
Quan Yuan, et. al.Quan Yuan ... Jinglin Li
01 Aug 2020
IEEE Transactions on Vehicular Technology | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Age of correlated information-optimal dynamic policy scheduling for sustainable Green IoT devices: A multi-agent deep reinforcement learning approach

Abstract

Talk to us

Similar Papers

More From: Internet of Things