Action-dependent Heuristic Dynamic Programming Research Articles

_ This article, written by JPT Technology Editor Chris Carpenter, contains highlights of paper SPE 200271, “Dual Heuristic Dynamic Programming in the Oil and Gas Industry for Trajectory Tracking Control,” by Seaar Al-Dabooni, Basra Oil Company; Alaa Azeez Tawiq, Technical Institute of Basra; and Hussen Alshehab, Basra Oil Company. The paper has not been peer reviewed. _ The complete paper presents an artificial intelligence (AI) algorithm the authors call dual heuristic dynamic programming (DHDP) that is used to solve optimization-control problems. Fast, self-learning control based on DHDP is illustrated for trajectory-tracking levels on a quadruple tank system (QTS) consisting of four tanks and two electrical pumps with two pressure-control valves. Two artificial neural networks are constructed for the DHDP approach: the critic network (the provider of a critique or evaluated signals) and the actor network or controller (the provider of control signals). The DHDP controller is learned without human intervention. Approximate Dynamic Programming (ADP) Recently, many different types of artificial algorithms have been applied in petroleum fields to solve optimization problems. This complete paper introduces a new field of AI applicable to oil and gas, ADP. ADP is a useful tool to overcome the behavior of nonlinear systems and is a special algorithm of reinforcement learning (RL). The authors write that ADP can be viewed as consisting of three categories: heuristic dynamic programming (HDP), DHDP, and globalized HDP. ADP features two neural networks—an actor and a critic—to provide an optimal control signal and long-cost value, respectively. ADP has numerous applications. The complete paper references work that discusses control on turbo-generator and swarm-robot problems by use of DHDP and that illustrates that action-dependent HDP can obtain an optimal path by multirobot navigation. QTS is frequently used in the oil and gas industry. DHDP is used to control the voltage of the two pumps to follow the desired level (set-point-level value) of the tanks, an approach that can learn by itself (a self-learning controller). The complete paper devotes several pages to equations and parameters that describe HDP. In ADP, optimal control problems are solved, thereby allowing agents to select an optimal action to minimize a long-term cost value through solution of Bellman’s equation. RL and ADP are used to train the actor neural network to provide optimal actions based on minimizing the cost-to-go value produced from the critic network. The actor function approximator is denoted for the actor neural network. After full training of these networks, the optimal action values are obtained from the actor network. System Functionality The equipment receives the system states of the process through sensors, and the algorithm maximizes the reward by selecting the correct optimal action (control signal) to feed the equipment. The simulation results for applying DHDP with QTS as a benchmark test problem were obtained using MATLAB. QTS is illustrated as an example in the paper because QTS is widely used in most petroleum exploration or production fields as an entire system or in parts. Another reason for the authors’ choice of QTS as a test problem is that QTS features a difficult model to control that has a limited zone of operating parameters to be stable. The multi-input/multioutput (MIMO) model of QTS was a similar model as most MIMO devices in the oil and gas field. The overall learning-control-system performance was tested and compared with HDP and a well-known industrial controller, a proportional integral derivative (PID) using MATLAB programming. The simulation results of DHDP provide enhanced performance compared with the PID approach, with a 98.9002% improvement.

To reveal the energy-saving mechanisms of global energy management, we propose a global optimization framework of “information layer-physical layer-energy layer-dynamic programming” (IPE-DP), which can realize the unity of different information scenarios, different vehicle configurations and energy conversions. The deterministic dynamic programing (DP) and adaptive dynamic programming (ADP) are taken as the core algorithms. As a benchmark for assessing the optimality, DP strategy has four main challenges: standardization, real-time application, accuracy, and satisfactory drivability. To solve the above problems, the IPE-DP optimization framework is established, which consists of three main layers, two interface layers and an application layer. To be specific, the full-factor trip information is acquired from three scenarios in the information layer, and then the feasible work modes of the vehicle are determined in the physical layer based on the proposed conservation framework of “kinetic/potential energy & onboard energy“. The above lays a foundation for the optimal energy distribution in the energy layer. Then, a global domain-searching algorithm and action-dependent heuristic dynamic programming (ADHDP) model are developed for different information acquisition scenarios to obtain the optimal solution. To improve the computational efficiency under the deterministic information, a fast DP is developed based on the statistical rules of DP behavior, the core of which is to restrict the exploring region based on a reference SOC trajectory. Regarding the stochastic trip information, the ADHDP model is established, including determining the utility function, network design and training process. Finally, two case studies are given to compare the economic performance of the vehicle under different information acquisition scenarios, which lays a foundation for analyzing the relationship between the amount of information input and energy-saving potential of the vehicle. Simulation results demonstrate that the proposed method gains a better performance in both real-time performance and global optimality.

Action-dependent Heuristic Dynamic Programming Research Articles

Related Topics

Articles published on Action-dependent Heuristic Dynamic Programming

Action-dependent heuristic dynamic programming for hybrid-order systems based on dynamic event-triggered mechanism

Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes

A novel energy management method for multiple residential energy systems with energy exchange

Vibration Suppression of a Flexible Beam Structure Coupled with Liquid Sloshing via ADP Control Based on FBG Strain Measurement

Adaptive fault-tolerant control for non-minimum phase hypersonic vehicles based on adaptive dynamic programming

Robust Self-Learning Fault-Tolerant Control for Hypersonic Flight Vehicle Based on ADHDP

Dual Heuristic Dynamic Programming Enables Trajectory Tracking Control

Economic Dispatch for Smart Buildings with Load Demand of High Volatility Based on Quasi-Quadratic Online Adaptive Dynamic Programming

A data-driven energy management method for parallel PHEVs based on action dependent heuristic dynamic programming (ADHDP) model

Small Leak Location for Intelligent Pipeline System via Action-Dependent Heuristic Dynamic Programming

Dual ML-ADHDP method for heterogeneous discrete-time nonlinear multi-agent systems with unknown dynamics and time delay

Global optimization energy management for multi-energy source vehicles based on “Information layer - Physical layer - Energy layer - Dynamic programming” (IPE-DP)

Torsional oscillation-considered mode transition coordinated control for a power-split PHEV based on action dependent heuristic dynamic programming

Research on ADHDP energy management strategy for fuel cell hybrid power system

Artificial-intelligence-based algorithms in multi-access edge computing for the performance optimization control of a benchmark microgrid

Integrated adaptive dynamic programming for data-driven optimal controller design

An Action Dependent Heuristic Dynamic Programming Approach for Algal Bloom Prediction With Time-Varying Parameters

Fuel-Efficient Gear Shift and Power Split Strategy for Parallel HEVs Based on Heuristic Dynamic Programming and Neural Networks

Ecological Adaptive Cruise Control and Energy Management Strategy for Hybrid Electric Vehicles Based on Heuristic Dynamic Programming

Morphing aircraft control based on switched nonlinear systems and adaptive dynamic programming

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Action-dependent Heuristic Dynamic Programming Research Articles

Related Topics

Articles published on Action-dependent Heuristic Dynamic Programming

Action-dependent heuristic dynamic programming for hybrid-order systems based on dynamic event-triggered mechanism

Action-Dependent Heuristic Dynamic Programming With Experience Replay for Wastewater Treatment Processes

A novel energy management method for multiple residential energy systems with energy exchange

Vibration Suppression of a Flexible Beam Structure Coupled with Liquid Sloshing via ADP Control Based on FBG Strain Measurement

Adaptive fault-tolerant control for non-minimum phase hypersonic vehicles based on adaptive dynamic programming

Robust Self-Learning Fault-Tolerant Control for Hypersonic Flight Vehicle Based on ADHDP

Dual Heuristic Dynamic Programming Enables Trajectory Tracking Control

Economic Dispatch for Smart Buildings with Load Demand of High Volatility Based on Quasi-Quadratic Online Adaptive Dynamic Programming

A data-driven energy management method for parallel PHEVs based on action dependent heuristic dynamic programming (ADHDP) model

Small Leak Location for Intelligent Pipeline System via Action-Dependent Heuristic Dynamic Programming

Dual ML-ADHDP method for heterogeneous discrete-time nonlinear multi-agent systems with unknown dynamics and time delay

Global optimization energy management for multi-energy source vehicles based on “Information layer - Physical layer - Energy layer - Dynamic programming” (IPE-DP)

Torsional oscillation-considered mode transition coordinated control for a power-split PHEV based on action dependent heuristic dynamic programming

Research on ADHDP energy management strategy for fuel cell hybrid power system

Artificial-intelligence-based algorithms in multi-access edge computing for the performance optimization control of a benchmark microgrid

Integrated adaptive dynamic programming for data-driven optimal controller design

An Action Dependent Heuristic Dynamic Programming Approach for Algal Bloom Prediction With Time-Varying Parameters

Fuel-Efficient Gear Shift and Power Split Strategy for Parallel HEVs Based on Heuristic Dynamic Programming and Neural Networks

Ecological Adaptive Cruise Control and Energy Management Strategy for Hybrid Electric Vehicles Based on Heuristic Dynamic Programming

Morphing aircraft control based on switched nonlinear systems and adaptive dynamic programming