Direct Heuristic Dynamic Programming Research Articles

Recently, solving the optimization-control problems by using artificial intelligence has widelyappeared in the petroleum fields in exploration and production. This paper presents the stateof-the-art reinforcement-learning algorithm applying in the petroleum optimization-controlproblems, which is called a direct heuristic dynamic programming (DHDP). DHDP has twointeractive artificial neural networks, which are the critic network (provider acritique/evaluated signal) and the actor network (provider a control signal). This paper focuseson a generic on-line learning control system in Markov decision process principles.Furthermore, DHDP is a model-free learning design that does not require prior knowledgeabout a dynamic model; therefore, DHDP can be appllied with any petroleum equipment ordevise directly without needed to drive a mathematical model. Moreover, DHDP learns byitself (self-learning) without human intervention via repeating the interaction between anequipment and environment/process. The equipment receives the states of theenvironment/process via sensors, and the algorithm maximizes the reward by selecting thecorrect optimal action (control signal). A quadruple tank system (QTS) is taken as a benchmarktest problem, that the nonlinear model responses close to the real model, for three reasons:First, QTS is widely used in the most petroleum exploration/production fields (entire system orparts), which consists of four tanks and two electrical-pumps with two pressure control valves.Second, QTS is a difficult model to control, which has a limited zone of operating parametersto be stable; therefore, if DHDP controls on QTS by itself, DHDP can control on otherequipment in a fast and optimal manner. Third, QTS is designed with a multi-input-multioutput (MIMO) model for analysis in the real-time nonlinear dynamic system; therefore, theQTS model has a similar model with most MIMO devises in oil and gas field. The overalllearning control system performance is tested and compared with a proportional integralderivative (PID) via MATLAB programming. DHDP provides enhanced performancecomparing with the PID approach with 99.2466% improvement.

Read full abstract

Robotic prostheses deliver greater function than passive prostheses, but we face the challenge of tuning a large number of control parameters in order to personalize the device for individual amputee users. This problem is not easily solved by traditional control designs or the latest robotic technology. Reinforcement learning (RL) is naturally appealing. The recent, unprecedented success of AlphaZero demonstrated RL as a feasible, large-scale problem solver. However, the prosthesis-tuning problem is associated with several unaddressed issues such as that it does not have a known and stable model, the continuous states and controls of the problem may result in a curse of dimensionality, and the human-prosthesis system is constantly subject to measurement noise, environmental change and human-body-caused variations. In this paper, we demonstrated the feasibility of direct heuristic dynamic programming, an approximate dynamic programming (ADP) approach, to automatically tune the 12 robotic knee prosthesis parameters to meet individual human users' needs. We tested the ADP-tuner on two subjects (one able-bodied subject and one amputee subject) walking at a fixed speed on a treadmill. The ADP-tuner learned to reach target gait kinematics in an average of 300 gait cycles or 10 min of walking. We observed improved ADP tuning performance when we transferred a previously learned ADP controller to a new learning session with the same subject. To the best of our knowledge, our approach to personalize robotic prostheses is the first implementation of online ADP learning control to a clinical problem involving human subjects.

Read full abstract

Direct Heuristic Dynamic Programming Research Articles

Related Topics

Articles published on Direct Heuristic Dynamic Programming

Online Reinforcement Learning Control by Direct Heuristic Dynamic Programming: From Time-Driven to Event-Driven.

Toward reliable designs of data-driven reinforcement learning tracking control for Euler–Lagrange systems

Virtual inertia control parameter regulator of doubly fed induction generator based on direct heuristic dynamic programming

Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

Self-Learning Controllers in the Oil and Gas Industry

Optimal Adaptive Super-Twisting Sliding-Mode Control Using Online Actor-Critic Neural Networks for Permanent-Magnet Synchronous Motor Drives

Online Reinforcement Learning Control for the Personalization of a Robotic Knee Prosthesis.

A New Powered Lower Limb Prosthesis Control Framework Based on Adaptive Dynamic Programming

Adaptive fuzzy optimal control using direct heuristic dynamic programming for chaotic discrete-time system

Direct heuristic dynamic programming based on an improved PID neural network

A boundedness result for the direct heuristic dynamic programming

Direct Heuristic Dynamic Programming for Nonlinear Tracking Control With Filtered Tracking Error

Performance Evaluation of Direct Heuristic Dynamic Programming using Control-Theoretic Measures

Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System

Supplementary Damping Controller Design using Direct Heuristic Dynamic Programming in Complex Power Systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Direct Heuristic Dynamic Programming Research Articles

Related Topics

Articles published on Direct Heuristic Dynamic Programming

Online Reinforcement Learning Control by Direct Heuristic Dynamic Programming: From Time-Driven to Event-Driven.

Toward reliable designs of data-driven reinforcement learning tracking control for Euler–Lagrange systems

Virtual inertia control parameter regulator of doubly fed induction generator based on direct heuristic dynamic programming

Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

Self-Learning Controllers in the Oil and Gas Industry

Optimal Adaptive Super-Twisting Sliding-Mode Control Using Online Actor-Critic Neural Networks for Permanent-Magnet Synchronous Motor Drives

Online Reinforcement Learning Control for the Personalization of a Robotic Knee Prosthesis.

A New Powered Lower Limb Prosthesis Control Framework Based on Adaptive Dynamic Programming

Adaptive fuzzy optimal control using direct heuristic dynamic programming for chaotic discrete-time system

Direct heuristic dynamic programming based on an improved PID neural network

A boundedness result for the direct heuristic dynamic programming

Direct Heuristic Dynamic Programming for Nonlinear Tracking Control With Filtered Tracking Error

Performance Evaluation of Direct Heuristic Dynamic Programming using Control-Theoretic Measures

Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System

Supplementary Damping Controller Design using Direct Heuristic Dynamic Programming in Complex Power Systems