Abstract
The goethite iron-removal process is an important procedure to remove the iron ions from the zinc hydrometallurgy. However, as a coherent system with complex reaction mechanism, associated uncertainties, and interconnected adjacent reactors, it is difficult for the process to accurately control the ion concentration. Because a large amount of historical data can be obtained during the process, an optimal control algorithm based on off-policy reinforcement learning is proposed in this paper to overcome these difficulties. According to the historical data, the weights of neural network are learned offline, and the optimal control strategy is solved online. Firstly, a bounded function is introduced to define the maximum effect of the coherent system on the subsystem cost function and to extend the cost function of the nominal system, so that the decentralized guaranteed cost control problem can be expressed as the optimal control problem of the nominal system. Then, an approximate iterative control algorithm based on actor-critic structure is proposed. The actor and critic neural networks are used to approximate control strategies and cost functions respectively. To achieve complete off-line, a new neural network is added to the actor-critic structure to approximate a part of the unknown system structure, and the three neural network parameters are optimized by the state transition algorithm. Finally, the strategy update and strategy iteration operations are performed alternately to learn optimal control strategies. The effectiveness and flexibility of the proposed off-policy optimal control method is validated by data from a real industrial goethite iron-removal process.
Highlights
Zinc is an important non-ferrous raw material, which plays an important role in various fields
When the given termination condition θj(i) − θj(i+1) ≤ ζ is satisfied or the number of iterations is greater than the given number of times, find a set of parameter vectors that minimizes the objective function as parameters for critic NN and actor NN; Step 7: According to the weights and basis functions of the neural network obtained by optimization and the current status of each subsystem collected by the system, the realtime optimization control strategy uj for each subsystem is solved according to equation (41)
This paper proposes an off-policy optimal control method based on reinforcement learning for the associated Iron-Removal system
Summary
Zinc is an important non-ferrous raw material, which plays an important role in various fields. N. Chen et al.: Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning leach solution from one reactor to the next. Chen et al.: Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning leach solution from one reactor to the It is a non-linear process involving a series of complex chemical reactions such as oxidation, hydrolysis, and neutralization. Han et al [6] and others transformed the dynamic optimization problem of the iron-removal process into a nonlinear mathematical programming problem and proposed a multi-objective optimization method based on the state transition algorithm and constrained non-dominated sorting to find the optimal solution of the oxygen concentration and zinc oxide addition. DESCRIPTION OF THE OPTIMAL CONTROL OF IRON-REMOVAL SYSTEM The object of goethite process is zinc sulphate solution obtained by direct leaching of zinc concentrate.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.