Dynamic Markov Decision Research Articles

The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32% while maintaining the indoor thermal in the comfortable range.

Read full abstract

In Industrial Internet of Things (IIoT), a large volume of data is collected periodically by IoT devices, and timely data routing and processing are important requirements. Age of Information (AoI), which is a metric to evaluate the freshness of status information in data processing, has become one of the most important objectives in IIoT. In this paper, considering limited communication, computation and energy resources on IoT devices, we jointly study the optimal AoI-aware energy control and computation offloading problem within a dynamic IIoT scenario with multiple IoT devices and multiple edge servers. Based on extensive analysis of real-life IoT dataset, Markovian queueing models are constructed to capture the dynamics of IoT devices and edge servers, and their corresponding analyses are provided. With the quantitative analytical results, we formulate a dynamic Markov decision problem with the objective of minimizing the long-term energy consumption while satisfying AoI constraints for real-time data processing. To solve the problem, we apply Deep Reinforcement Learning (DRL) techniques for adapting to large-scale dynamic IIoT environments, and design an intelligent Energy Control and Computation Offloading (ECCO) algorithm. Extensive simulation experiments are conducted based on real-world dataset, and the comparison results illustrate the superiority of our ECCO algorithm over both existing DRL and non-DRL algorithms.

Read full abstract

Dynamic Markov Decision Research Articles

Articles published on Dynamic Markov Decision

Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning

Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

Smart Elderly Care: An Intelligent e-Procurement System for Elderly Supplier Selecting

AoI-aware energy control and computation offloading for industrial IoT

Stochastic resource scheduling via bilayer dynamic Markov decision process in mobile cloud networks

Assigning multiple job types to parallel specialized servers

A Framework for Dynamic Context-Centric Commander Decision Support

Dynamic Markov Decision Policies for Delay Constrained Wireless Scheduling

Modeling prawn production management system: A dynamic Markov decision approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dynamic Markov Decision Research Articles

Articles published on Dynamic Markov Decision

Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning

Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

Smart Elderly Care: An Intelligent e-Procurement System for Elderly Supplier Selecting

AoI-aware energy control and computation offloading for industrial IoT

Stochastic resource scheduling via bilayer dynamic Markov decision process in mobile cloud networks

Assigning multiple job types to parallel specialized servers

A Framework for Dynamic Context-Centric Commander Decision Support

Dynamic Markov Decision Policies for Delay Constrained Wireless Scheduling

Modeling prawn production management system: A dynamic Markov decision approach