Abstract

Recently, a direct-comparison approach has been developed to control and optimize the performance of a stochastic Markov system, see [1] for discrete time case, and [2] for continuous time case. Compared with dynamic programming, the standard approach for stochastic control problem, this alternative approach is simple and intuitive. It is based on the direct comparison of the system performance under two policies, and discounting is not needed when dealing with long-run average criterion. By directly applying this approach, we studied the continuous time Linear Quadratic Gaussian (LQG) control problem, obtained the optimal policy for the long run average criterion without introducing discounting. The well known algebraic Riccati equation for the optimal policy can be easily obtained by this direct-comparison approach. This paper servers as an example to show the effectiveness of direct-comparison approach for the continuous time case.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call