Reinforcement Learning For Optimization Research Articles

In the electric power generation sector, striking a balance between maximum power production and acceptable emission limits is a challenging task that requires sophisticated techniques. With traditional methods, this is an extremely complex issue due to the large number of process variables that are involved. In this paper, a deep reinforcement learning optimization framework (DRLOF) is proposed to determine the optimal operating conditions for a commercial circulating fluidized bed (CFB) power plant that strikes a good balance between performance and environmental issues. The DRLOF included the CFB as an environment created from over 1.5 years of plant data with a 1 min sampling time which interacted with an advantage actor-critic (A2C) agent of two architectures named ‘separate-A2CN’ and ‘shared-A2CN’. The framework was optimized by maximizing electric power generation within the constraints of the plant’s capacity and environmental emission standards, taking into consideration the cost of operations. After training, the framework of the separate-A2CN architecture achieved a 1.97% increase in electricity generation and 1.59% emission reduction for NOx at 14.3 times lower computational cost. Furthermore, we demonstrated the framework’s flexibility, adaptability and lower computational burden by carrying out different test scenarios which demonstrated the effectiveness of the DRLOF. The findings of this study are not limited to the CFB power plant but can be extended to other chemical processes and industries. This approach minimizes the need for costly experiments, online optimization challenges and associated customizations. • Multi-objective optimization of CFB power plant with deep reinforcement learning. • Objective formulation considered power, fuel, reagent and environmental standards. • Two main reinforcement learning architectures were analysed for better CFB results. • Improved performance of CFB by 1.97% power increase and 1.59% emission reduction. • The framework’s generality, adaptability and computational efficiency were tested.

Experimental measurements or computational model predictions of the post-translational regulation of enzymes needed in a metabolic pathway is a difficult problem. Consequently, regulation is mostly known only for well-studied reactions of central metabolism in various model organisms. In this study, we use two approaches to predict enzyme regulation policies and investigate the hypothesis that regulation is driven by the need to maintain the solvent capacity in the cell. The first predictive method uses a statistical thermodynamics and metabolic control theory framework while the second method is performed using a hybrid optimization–reinforcement learning approach. Efficient regulation schemes were learned from experimental data that either agree with theoretical calculations or result in a higher cell fitness using maximum useful work as a metric. As previously hypothesized, regulation is herein shown to control the concentrations of both immediate and downstream product concentrations at physiological levels. Model predictions provide the following two novel general principles: (1) the regulation itself causes the reactions to be much further from equilibrium instead of the common assumption that highly non-equilibrium reactions are the targets for regulation; and (2) the minimal regulation needed to maintain metabolite levels at physiological concentrations maximizes the free energy dissipation rate instead of preserving a specific energy charge. The resulting energy dissipation rate is an emergent property of regulation which may be represented by a high value of the adenylate energy charge. In addition, the predictions demonstrate that the amount of regulation needed can be minimized if it is applied at the beginning or branch point of a pathway, in agreement with common notions. The approach is demonstrated for three pathways in the central metabolism of E. coli (gluconeogenesis, glycolysis-tricarboxylic acid (TCA) and pentose phosphate-TCA) that each require different regulation schemes. It is shown quantitatively that hexokinase, glucose 6-phosphate dehydrogenase and glyceraldehyde phosphate dehydrogenase, all branch points of pathways, play the largest roles in regulating central metabolism.

Reinforcement Learning For Optimization Research Articles

Related Topics

Articles published on Reinforcement Learning For Optimization

Observation Time Effects in Reinforcement Learning on Contracts for Difference

Reinforcement Learning Power Control Algorithm Based on Graph Signal Processing for Ultra-Dense Mobile Networks

Deep reinforcement learning optimization framework for a power generation plant considering performance and environmental issues

A reinforcement learning optimization for future smart cities using software defined networking

Constrained Q-Learning for Batch Process Optimization

Optimization of Drilling Cost Using Artificial Intelligence

Physics-informed reinforcement learning optimization of nuclear assembly design

Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks.

On Entropy Regularized Path Integral Control for Trajectory Optimization.

Enzyme activities predicted by metabolite concentrations and solvent capacity in the cell.

WSN-Assisted UAV Trajectory Adjustment for Pesticide Drift Control

A hierarchical constrained reinforcement learning for optimization of bitumen recovery rate in a primary separation vessel

BOLeRo: Behavior optimization and learning for robots

Online optimal and adaptive integral tracking control for varying discrete‐time systems using reinforcement learning

A novel axle temperature forecasting method based on decomposition, reinforcement learning optimization and neural network

Optimization for Reinforcement Learning: From a single agent to cooperative agents

UCRLF: unified constrained reinforcement learning framework for phase-aware architectures for autonomous vehicle signaling and trajectory optimization

Distributed Reinforcement Learning Algorithm for Dynamic Economic Dispatch With Unknown Generation Cost Functions

Directed Exploration in Black-Box Optimization for Multi-Objective Reinforcement Learning

A Novel Medical Image Edge Detection Method Based on Reinforcement Learning and Ant Colony Optimization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning For Optimization Research Articles

Related Topics

Articles published on Reinforcement Learning For Optimization

Observation Time Effects in Reinforcement Learning on Contracts for Difference

Reinforcement Learning Power Control Algorithm Based on Graph Signal Processing for Ultra-Dense Mobile Networks

Deep reinforcement learning optimization framework for a power generation plant considering performance and environmental issues

A reinforcement learning optimization for future smart cities using software defined networking

Constrained Q-Learning for Batch Process Optimization

Optimization of Drilling Cost Using Artificial Intelligence

Physics-informed reinforcement learning optimization of nuclear assembly design

Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks.

On Entropy Regularized Path Integral Control for Trajectory Optimization.

Enzyme activities predicted by metabolite concentrations and solvent capacity in the cell.

WSN-Assisted UAV Trajectory Adjustment for Pesticide Drift Control

A hierarchical constrained reinforcement learning for optimization of bitumen recovery rate in a primary separation vessel

BOLeRo: Behavior optimization and learning for robots

Online optimal and adaptive integral tracking control for varying discrete‐time systems using reinforcement learning

A novel axle temperature forecasting method based on decomposition, reinforcement learning optimization and neural network

Optimization for Reinforcement Learning: From a single agent to cooperative agents

UCRLF: unified constrained reinforcement learning framework for phase-aware architectures for autonomous vehicle signaling and trajectory optimization

Distributed Reinforcement Learning Algorithm for Dynamic Economic Dispatch With Unknown Generation Cost Functions

Directed Exploration in Black-Box Optimization for Multi-Objective Reinforcement Learning

A Novel Medical Image Edge Detection Method Based on Reinforcement Learning and Ant Colony Optimization