Average cost Markov decision processes under the hypothesis of Doeblin

Masami Kurano

doi:10.1007/bf02283606

Average cost Markov decision processes under the hypothesis of Doeblin

Masami Kurano

https://doi.org/10.1007/bf02283606

Copy DOI

Journal: Annals of Operations Research	Publication Date: Dec 1, 1991
Citations: 13

Affiliation: Chiba University

#Average Cost Markov Decision Processes #Ergodic Classes + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Average cost Markov decision processes (MDPs) with compact state and action spaces and bounded lower semicontinuous cost functions are considered. Kurano [7] has treated the general case in which several ergodic classes and a transient set are permitted for the Markov process induced by any randomized stationary policy under the hypothesis of Doeblin and showed the existence of a minimum pair of state and policy. This paper considers the same case as that discussed in Kurano [7] and proves some new results which give the existence theorem of an optimal stationary policy under some reasonable conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Annals of Operations Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.