Model-free adaptive optimal control policy for Markov jump systems: A value iterations algorithm

Peixin Zhou,Xiaoli Luan,Jiwei Wen,Akshya Kumar Swain

doi:10.1177/09596518221116951

Abstract

This article develops a model-free adaptive optimal control policy for discrete-time Markov jump systems. First, a two-player zero-sum game is formulated to obtain an optimal control policy that minimizes a cost function against the worst-case disturbance. Second, an action and mode-dependent value function is set up for zero-sum game to search such a policy with convergence guarantee rather than solving an optimization problem satisfying coupled algebraic Riccati equations. To be specific, motivated by the Bellman optimal principle, we develop an online value iterations algorithm to solve the zero-sum game, which is learning while controlling without any initial stabilizing policy. By this algorithm, we can achieve disturbance attenuation for Markov jump systems without knowledge of the system matrices. The adaptivity to slowly changing uncertainties can also be achieved due to the model-free feature and policy convergence. Finally, the effectiveness and practical potential of the algorithm are demonstrated by considering two numerical examples and a solar boiler system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model-free adaptive optimal control policy for Markov jump systems: A value iterations algorithm

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering

Lead the way for us

Journal: Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering	Publication Date: Aug 31, 2022
Citations: 2

Similar Papers

Contributions to Model-Free Adaptive Control for Complex Mechanical Systems

-

27 Jan 2021
27 Jan 2021

Model-Free Control Design for Nonlinear Mechanical Systems

-

01 Jan 2019
01 Jan 2019

An Improved Method of Model-Free Adaptive Predictive Control: A Case of pH Neutralization in WWTP
Jufeng Li ... Zhihe Tang
Processes | VOL. 11
Jufeng Li, et. al.Jufeng Li ... Zhihe Tang
10 May 2023
Processes | VOL. 11

Research on the application of model free adaptive (MFA) control in gas turbine
Aidong Xu ... Yangbo Zheng
-
Aidong Xu, et. al.Aidong Xu ... Yangbo Zheng
01 Aug 2009
01 Aug 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model-free adaptive optimal control policy for Markov jump systems: A value iterations algorithm

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering