Adaptive Q-learning Research Articles

Scheduling efficient energy management system operations to respond to the unstable customer demand, electricity prices, and weather increases the complexity of the control systems and requires a flexible and cost-effective control policy. This study develops an intelligent and real-time battery energy storage control based on a reinforcement learning model focused on residential houses connected to the grid and equipped with solar photovoltaic panels and a battery energy storage system. Because the reinforcement learning’s performance is very dependent on the design of the underlying Markov decision process, a cyclic time-dependent Markov Process is uniquely designed to capture existing daily cyclic patterns in demand, electricity price, and solar energy. The Markov Process is successfully used in the Q-learning algorithm, resulting in more efficient battery energy control and saving electricity costs. The proposed Q-learning algorithm is compared with benchmark models of a deterministic equivalent solution and a One-step Roll-out algorithm. Numerical experiments show the gap between the deterministic equivalent solution and Q-learning approaches for one-month electricity cost decreased from 7.99% to 3.63% for house 27 and 6.91% to 3.26% for house 387 when the discrete size of demand, solar energy, price, and battery energy level adjusted to 20. Accordingly, the better performance of the proposed Q-learning is demonstrated compared to the One-step Roll-out algorithm. Moreover, the effect of discrete size of state-space parameters on the adaptive Q-learning performance and computational time are investigated. Variations in the electricity price significantly affect the Q-learning algorithm’s performance more than other parameters.

Cloud-assisted internet of things (CIoT) is backboned by the wireless sensor network (WSN) architecture. A sensor network is an autonomous self-resource constraint collection of internet of things (IoT) sensor nodes. The nodes communicate in an ad-hoc fashion to transfer cloud information over the virtual environment. Clustering in WSNs helps to improve the quality of the network by controlling energy consumption and improving data gathering accuracy. This improves the service rates of CIoT. Optimizing IoT sensor networks through energy and overhead management requires complex clustering algorithms. A simple clustering scheme cannot achieve the desired performance enhancement during transmission in a virtual environment. This research attempts to propose a reinforcement-based learning technique, adaptive Q-learning (AQL) to improve network performance with minimum energy–overhead tradeoff in a sensor network-aided CIoT. AQL operates in two distinct phases for cluster head selection and forwarder selection. The decision-making system is used to qualify nodes based on their past behavior over transmission. AQL improves both inter- and intra-cluster communication optimization through adaptive forwarder and header selection conditions. The simulation results prove the consistency of the proposed AQL by retaining the live node counts in the network and their persistent energy despite the reduced overheads in the sensor network. With the achievement of constructive features in the sensor networks, the performance of CIoT is considerably improved. The experimental results illustrate the effectiveness of the proposed learning technique by improving network lifetime with a high request–response rate and by minimizing delay, overhead, and request failures.

Adaptive Q-learning Research Articles

Related Topics

Articles published on Adaptive Q-learning

An adaptive Q-learning based particle swarm optimization for multi-UAV path planning

Optimizing data aggregation and clustering in Internet of things networks using principal component analysis and Q-learning

Reinforcement learning-based hybrid differential evolution for global optimization of interplanetary trajectory design

Control with adaptive Q-learning: A comparison for two classical control problems

Battery energy storage control using a reinforcement learning approach with cyclic time-dependent Markov process

Online adaptive Q-learning method for fully cooperative linear quadratic dynamic games

Optimizing the network energy of cloud assisted internet of things by using the adaptive neural learning approach in wireless sensor networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Adaptive Q-learning Research Articles

Related Topics

Articles published on Adaptive Q-learning

An adaptive Q-learning based particle swarm optimization for multi-UAV path planning

Optimizing data aggregation and clustering in Internet of things networks using principal component analysis and Q-learning

Reinforcement learning-based hybrid differential evolution for global optimization of interplanetary trajectory design

Control with adaptive Q-learning: A comparison for two classical control problems

Battery energy storage control using a reinforcement learning approach with cyclic time-dependent Markov process

Online adaptive Q-learning method for fully cooperative linear quadratic dynamic games

Optimizing the network energy of cloud assisted internet of things by using the adaptive neural learning approach in wireless sensor networks