Optimal Control with Partially Observed Regime Switching: Discounted and Average Payoffs

Beatris Adriana Escobedo-Trujillo,Javier Garrido-Meléndez,J D Revuelta-Acosta,Gerardo Alcalá

doi:10.3390/math10122073

Beatris Adriana Escobedo-Trujillo, Javier Garrido-Meléndez + Show 2 more

Open Access

https://doi.org/10.3390/math10122073

Copy DOI

Journal: Mathematics	Publication Date: Jun 15, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: Universidad Veracruzana

Abstract

We consider an optimal control problem with the discounted and average payoff. The reward rate (or cost rate) can be unbounded from above and below, and a Markovian switching stochastic differential equation gives the state variable dynamic. Markovian switching is represented by a hidden continuous-time Markov chain that can only be observed in Gaussian white noise. Our general aim is to give conditions for the existence of optimal Markov stationary controls. This fact generalizes the conditions that ensure the existence of optimal control policies for optimal control problems completely observed. We use standard dynamic programming techniques and the method of hidden Markov model filtering to achieve our goals. As applications of our results, we study the discounted linear quadratic regulator (LQR) problem, the ergodic LQR problem for the modeled quarter-car suspension, the average LQR problem for the modeled quarter-car suspension with damp, and an explicit application for an optimal pollution control.

Full Text