Reinforcement Learning-based Framework Research Articles

This article presents a multi-agent framework for optimization using metaheuristics, called AMAM. In this proposal, each agent acts independently in the search space of a combinatorial optimization problem. Agents share information and collaborate with each other through the environment. The goal is to enable the agent to modify their actions based on experiences gained in interacting with the other agents and the environment using the concepts of Reinforcement Learning. For better introduction and validation of the AMAM framework, this article uses the instantiation of the Vehicle Routing Problem with Time Windows (VRPTW) and the Unrelated Parallel Machine Scheduling Problem with Sequence-Dependent Setup Times (UPMSP-ST), i.e., two classic combinatorial optimization problems. The main objective of the experiments is to evaluate the performance of the proposed adaptive agents. The experiments confirm that the ability to learn attributed to the agent directly influences the quality of solutions, both from the individual point of view and from the point of view of teamwork. In this way, the framework presented here is a step forward in relation to the other frameworks of the literature regarding to the adaptation to the particular aspects of the problems. Additionally, the cooperation between agents and their ability to influence the quality of the solutions of the agents involved in the search of the solution is confirmed. The results also strengthen the issue of the scalability of the framework, since, with the addition of new agents, there is an improvement of the solutions obtained.

The IoT is the cornerstone of many innovating processes such as those behind Smart Cities and Smart Industries. As more and more wireless IoT devices are deployed, a newer, more congestion-resilient communication infrastructure is required to absorb the traffic from the 50 billion IoT nodes expected by the year 2020. Although 5G is said to be a key technology for the future IoT, it is not a silver bullet. Therefore, providing nodes with different Radio Access Technologies (RAT) would allow them to reap the various benefits offered by each RAT. However, the process of determining which technology should be used at any given time should not be based on uninformed intuition, but on mathematically educated choices. By making use of the mathematical framework of Reinforcement Learning, we have allowed IoT nodes to learn from previous real world data in order to derive optimal RAT-selection policies. These policies, which are implemented as Artificial Neural Networks (ANN), maximize a predefined reward closely related to classic throughput, while maintaining power consumption and operational costs below a certain limit. To allow hardware-constrained IoT nodes to use these ANNs, we have proposed the application of a quantization technique that reduces computation and memory requirements and have validated it by its implementation in a real IoT device. Finally, to evaluate the proposal, we have simulated a network of 1000 devices deployed in the city of Chicago. The obtained results are compared to those achieved with other intuitive policies to further highlight the benefits of our proposal.

Reinforcement Learning-based Framework Research Articles

Articles published on Reinforcement Learning-based Framework

Autonomous Resource Slicing for Virtualized Vehicular Networks With D2D Communications Based on Deep Reinforcement Learning

HetMEC: Heterogeneous Multi-Layer Mobile Edge Computing in the 6 G Era

User preference-aware video highlight detection via deep reinforcement learning

A Reinforcement Learning-Based Framework for Robot Manipulation Skill Acquisition

A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems

Fuzzy Categorical Deep Reinforcement Learning of a Defensive Game for an Unmanned Surface Vessel

A Multi-Objective Agent-Based Control Approach With Application in Intelligent Traffic Signal System

A Reinforcement Learning-Based Framework for the Exploitation of Multiple RATs in the IoT

A Novel Nested Q-Learning Method to Tackle Time-Constrained Competitive Influence Maximization

Reinforcement Learning-Based Control for Unmanned Aerial Vehicles

A Deep Reinforcement Learning-Based Framework for Dynamic Resource Allocation in Multibeam Satellite Systems

Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward

Ultra-Reliable Communication in 5G mmWave Networks: A Risk-Sensitive Approach

A Reinforcement Learning Based Framework for Prediction of Near Likely Nodes in Data-Centric Mobile Wireless Networks

Letter to the Editor—A Proof of the Optimality of the Shortest Remaining Processing Time Discipline

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning-based Framework Research Articles

Articles published on Reinforcement Learning-based Framework

Autonomous Resource Slicing for Virtualized Vehicular Networks With D2D Communications Based on Deep Reinforcement Learning

HetMEC: Heterogeneous Multi-Layer Mobile Edge Computing in the 6 G Era

User preference-aware video highlight detection via deep reinforcement learning

A Reinforcement Learning-Based Framework for Robot Manipulation Skill Acquisition

A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems

Fuzzy Categorical Deep Reinforcement Learning of a Defensive Game for an Unmanned Surface Vessel

A Multi-Objective Agent-Based Control Approach With Application in Intelligent Traffic Signal System

A Reinforcement Learning-Based Framework for the Exploitation of Multiple RATs in the IoT

A Novel Nested Q-Learning Method to Tackle Time-Constrained Competitive Influence Maximization

Reinforcement Learning-Based Control for Unmanned Aerial Vehicles

A Deep Reinforcement Learning-Based Framework for Dynamic Resource Allocation in Multibeam Satellite Systems

Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward

Ultra-Reliable Communication in 5G mmWave Networks: A Risk-Sensitive Approach

A Reinforcement Learning Based Framework for Prediction of Near Likely Nodes in Data-Centric Mobile Wireless Networks

Letter to the Editor—A Proof of the Optimality of the Shortest Remaining Processing Time Discipline