Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning

Ning Chen,Weihua Gui,Jiayang Dai,Shuhan Luo,Biao Luo

doi:10.1109/access.2020.3015801

Ning Chen, Weihua Gui + Show 3 more

Open Access

https://doi.org/10.1109/access.2020.3015801

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 8	License type: CC BY 4.0

Affiliation: Central South University

Abstract

The goethite iron-removal process is an important procedure to remove the iron ions from the zinc hydrometallurgy. However, as a coherent system with complex reaction mechanism, associated uncertainties, and interconnected adjacent reactors, it is difficult for the process to accurately control the ion concentration. Because a large amount of historical data can be obtained during the process, an optimal control algorithm based on off-policy reinforcement learning is proposed in this paper to overcome these difficulties. According to the historical data, the weights of neural network are learned offline, and the optimal control strategy is solved online. Firstly, a bounded function is introduced to define the maximum effect of the coherent system on the subsystem cost function and to extend the cost function of the nominal system, so that the decentralized guaranteed cost control problem can be expressed as the optimal control problem of the nominal system. Then, an approximate iterative control algorithm based on actor-critic structure is proposed. The actor and critic neural networks are used to approximate control strategies and cost functions respectively. To achieve complete off-line, a new neural network is added to the actor-critic structure to approximate a part of the unknown system structure, and the three neural network parameters are optimized by the state transition algorithm. Finally, the strategy update and strategy iteration operations are performed alternately to learn optimal control strategies. The effectiveness and flexibility of the proposed off-policy optimal control method is validated by data from a real industrial goethite iron-removal process.

Highlights

Zinc is an important non-ferrous raw material, which plays an important role in various fields
When the given termination condition θj(i) − θj(i+1) ≤ ζ is satisfied or the number of iterations is greater than the given number of times, find a set of parameter vectors that minimizes the objective function as parameters for critic NN and actor NN; Step 7: According to the weights and basis functions of the neural network obtained by optimization and the current status of each subsystem collected by the system, the realtime optimization control strategy uj for each subsystem is solved according to equation (41)
This paper proposes an off-policy optimal control method based on reinforcement learning for the associated Iron-Removal system

Summary

INTRODUCTION

Zinc is an important non-ferrous raw material, which plays an important role in various fields. N. Chen et al.: Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning leach solution from one reactor to the next. Chen et al.: Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning leach solution from one reactor to the It is a non-linear process involving a series of complex chemical reactions such as oxidation, hydrolysis, and neutralization. Han et al [6] and others transformed the dynamic optimization problem of the iron-removal process into a nonlinear mathematical programming problem and proposed a multi-objective optimization method based on the state transition algorithm and constrained non-dominated sorting to find the optimal solution of the oxygen concentration and zinc oxide addition. DESCRIPTION OF THE OPTIMAL CONTROL OF IRON-REMOVAL SYSTEM The object of goethite process is zinc sulphate solution obtained by direct leaching of zinc concentrate.

ACTOR-CRITIC NEURAL NETWORK AND ITS

SIMULATION

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Decentralized Adaptive Optimal Tracking Control for Massive Multi-agent Systems: An Actor-Critic-Mass Algorithm
Zejian Zhou ... Hao Xu
-
Zejian Zhou, et. al.Zejian Zhou ... Hao Xu
01 Dec 2019
01 Dec 2019

Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design
Biao Luo ... Derong Liu
Automatica | VOL. 50
Biao Luo, et. al.Biao Luo ... Derong Liu
25 Oct 2014
Automatica | VOL. 50

Decentralized Adaptive Optimal Tracking Control for Massive Multi-Agent Systems with Input Constraint
Zejian Zhou ... Hao Xu
-
Zejian Zhou, et. al.Zejian Zhou ... Hao Xu
01 Dec 2019
01 Dec 2019

Online approximate optimal control for affine non‐linear systems with unknown internal dynamics using adaptive dynamic programming
Xiong Yang ... Derong Liu
IET Control Theory & Applications | VOL. 8
Xiong Yang, et. al.Xiong Yang ... Derong Liu
01 Nov 2014
IET Control Theory & Applications | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access