Model-based Optimization Research Articles

PurposeCurrent reinforcement learning (RL) algorithms are facing issues such as low learning efficiency and poor generalization performance, which significantly limit their practical application in real robots. This paper aims to adopt a hybrid model-based and model-free policy search method with multi-timescale value function tuning, aiming to allow robots to learn complex motion planning skills in multi-goal and multi-constraint environments with a few interactions.Design/methodology/approachA goal-conditioned model-based and model-free search method with multi-timescale value function tuning is proposed in this paper. First, the authors construct a multi-goal, multi-constrained policy optimization approach that fuses model-based policy optimization with goal-conditioned, model-free learning. Soft constraints on states and controls are applied to ensure fast and stable policy iteration. Second, an uncertainty-aware multi-timescale value function learning method is proposed, which constructs a multi-timescale value function network and adaptively chooses the value function planning timescales according to the value prediction uncertainty. It implicitly reduces the value representation complexity and improves the generalization performance of the policy.FindingsThe algorithm enables physical robots to learn generalized skills in real-world environments through a handful of trials. The simulation and experimental results show that the algorithm outperforms other relevant model-based and model-free RL algorithms.Originality/valueThis paper combines goal-conditioned RL and the model predictive path integral method into a unified model-based policy search framework, which improves the learning efficiency and policy optimality of motor skill learning in multi-goal and multi-constrained environments. An uncertainty-aware multi-timescale value function learning and selection method is proposed to overcome long horizon problems, improve optimal policy resolution and therefore enhance the generalization ability of goal-conditioned RL.

Electro-activated sulfite system (E-SO32−) was an emerging advanced oxidation technology, but its kinetics and process optimization have been overlooked. This study developed a first-principle kinetic model to investigate carbamazepine (CBZ) degradation and optimized this process by aeration and multiple dosages of SO32−. A low aeration rate (140 mL/min) remarkably accelerated sulfate radical (SO4•−) generation because of the enhanced rate of sulfite radical (SO3•−) reacting with oxygen. Thus, the degradation efficiency of CBZ improved from 71 % to 84 % and the electrical energy per order (EE/O) decreased from 4.55 to 2.32 kWh/m3/order at current density of 103 A/m2 and initial dosage of SO32− of 4 mM. However, further increasing the aeration rate diminished the enhancement of SO4•− generation rate due to the limited SO3•− generation, and excessive aeration resulted in decreased degradation efficiency and increased cost of aeration due to the direct oxidation of SO32− by oxygen. Consequently, matching of SO32− concentration, current density and aeration rate was crucial to maximize the generation rate of SO4•−. The most optimal strategy was at current density of 75 A/m2, aeration rate of 100 mL/min and SO32− concentration of 1.1 mM with multiple dosages. 95 % of CBZ was removed in 20 min, 18 % higher than that in single dosage. Finally, the optimal strategy for CBZ degradation was successfully applied in different water matrixes and real wastewater and well predicted by the kinetic model. Furthermore, degradation pathways of CBZ, including hydroxylation and deamidation, were elucidated from density functional theory and LC-MS detection. The toxicity of most intermediates decreased, while some intermediates possessed higher toxicity than CBZ. Overall, the firstly proposed combined strategy of aeration and multiple dosages of SO32− significantly facilitated the application of E-SO32− system in contaminants degradation.

Model-based Optimization Research Articles

Related Topics

Articles published on Model-based Optimization

A coordinated active and reactive power optimization approach for multi-microgrids connected to distribution networks with multi-actor-attention-critic deep reinforcement learning

Improvement in Natural Antioxidant Recovery from Sea Buckthorn Berries Using Predictive Model-Based Optimization

Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based Optimization.

Pharmacokinetics and pharmacodynamics of intravenous delafloxacin in healthy subjects: model-based dose optimization.

A goal-conditioned policy search method with multi-timescale value function tuning

Management strategy of granular sludge settleability in saline denitrification: Insights from machine learning

Mathematical Model-Based Optimization of Continuous Flow Photobioreactor Operating at Steady State Using MATLAB Optimization Function

Model-based process optimization of black soldier fly egg production.

Efficient Expansion Algorithm of Urban Logistics Network for Medical Products Considering Environmental Impact

Parameter Tuning Approach for Incremental Nonlinear Dynamic Inversion-Based Flight Controllers

Multi-objective optimization and energy efficiency improvement for rotor duct in integrated energy recovery-pressure boost device

Hybrid Centralized Training and Decentralized Execution Reinforcement Learning in Multi-Agent Path-Finding Simulations

A bilevel fast-convergent optimizer via high-fidelity convex models: Application on optimal operation of all-parallel heterogeneous chiller-pump systems

A novel deep learning model-based optimization algorithm for text message spam detection

Surrogate Model-Based Filter Optimization by a Field-Circuit Model Mapping

Experimentally implemented dynamic optogenetic optimization of ATPase expression using knowledge-based and Gaussian-process-supported models

Model-based design optimization for motion decoupling in dual-segment flexible robots

Electromechanical model for electro-ribbon actuators

Combined Observer-Based State Feedback and Optimized P/PI Control for a Robust Operation of Quadrotors

Kinetic model-based optimization in electro-activated sulfite system for carbamazepine degradation: Aeration and multiple dosages of sulfite

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Model-based Optimization Research Articles

Related Topics

Articles published on Model-based Optimization

A coordinated active and reactive power optimization approach for multi-microgrids connected to distribution networks with multi-actor-attention-critic deep reinforcement learning

Improvement in Natural Antioxidant Recovery from Sea Buckthorn Berries Using Predictive Model-Based Optimization

Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based Optimization.

Pharmacokinetics and pharmacodynamics of intravenous delafloxacin in healthy subjects: model-based dose optimization.

A goal-conditioned policy search method with multi-timescale value function tuning

Management strategy of granular sludge settleability in saline denitrification: Insights from machine learning

Mathematical Model-Based Optimization of Continuous Flow Photobioreactor Operating at Steady State Using MATLAB Optimization Function

Model-based process optimization of black soldier fly egg production.

Efficient Expansion Algorithm of Urban Logistics Network for Medical Products Considering Environmental Impact

Parameter Tuning Approach for Incremental Nonlinear Dynamic Inversion-Based Flight Controllers

Multi-objective optimization and energy efficiency improvement for rotor duct in integrated energy recovery-pressure boost device

Hybrid Centralized Training and Decentralized Execution Reinforcement Learning in Multi-Agent Path-Finding Simulations

A bilevel fast-convergent optimizer via high-fidelity convex models: Application on optimal operation of all-parallel heterogeneous chiller-pump systems

A novel deep learning model-based optimization algorithm for text message spam detection

Surrogate Model-Based Filter Optimization by a Field-Circuit Model Mapping

Experimentally implemented dynamic optogenetic optimization of ATPase expression using knowledge-based and Gaussian-process-supported models

Model-based design optimization for motion decoupling in dual-segment flexible robots

Electromechanical model for electro-ribbon actuators

Combined Observer-Based State Feedback and Optimized P/PI Control for a Robust Operation of Quadrotors

Kinetic model-based optimization in electro-activated sulfite system for carbamazepine degradation: Aeration and multiple dosages of sulfite