Safe online optimization of motor speed synchronization control with incremental Q-learning

Jianfeng Huang,Guoqiang Lu,Xudong Yao

doi:10.1016/j.eswa.2024.124622

Abstract

Reinforcement learning (RL) is promising for online controller optimization. However, its practical application has been hindered by safety issues. This paper proposes an algorithm named Incremental Q-learning (IQ) and applies it to the online optimization of motor speed synchronization control. IQ ensures safe learning by adopting so-called incremental action variables which represent incremental change rather than absolute magnitude, and dividing the one-round learning process in the classic Q-learning (in this paper referred to as Absolute Q-learning, AQ) into multiple consecutive ones with the Q table getting reset at the beginning of each round. Since the permitted interval of change is restricted to be very small, the agent can learn its way safely, steadily, and robustly towards the optimal policy. Simulation results show that IQ is advantageous to AQ in optimality, safety, and adaptability. IQ converges to better final performances with significantly smaller performance variance along the whole learning process, smaller torque trajectory deviation between consecutive episodes and adapts to unknown disturbances faster. It is of great potential for online controller optimization/tuning in practical engineering projects. Source code and demos are provided.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Safe online optimization of motor speed synchronization control with incremental Q-learning

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

A method to solve unbalanced multi-label text classification in practical projects by introducing association information between classes
Yi Zhou ... Zheng Ji
-
Yi Zhou, et. al.Yi Zhou ... Zheng Ji
20 Oct 2022
20 Oct 2022

Reducing flow fluctuation using deep reinforcement learning with a CNN-based flow feature model
Shuran Ye ... Chenguang Huang
Ocean Engineering | VOL. 306
Shuran Ye, et. al.Shuran Ye ... Chenguang Huang
01 Aug 2024
Ocean Engineering | VOL. 306

Reinforcement Learning with an Extended Classifier System in Zero-sum Markov Games
Chang Wang ... Xiaojia Xiang
-
Chang Wang, et. al.Chang Wang ... Xiaojia Xiang
01 Oct 2019
01 Oct 2019

A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret
Sheelabhadra Dey ... Josiah P Hanna
-
Sheelabhadra Dey, et. al.Sheelabhadra Dey ... Josiah P Hanna
27 Sep 2021
27 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Safe online optimization of motor speed synchronization control with incremental Q-learning

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications