Finding Optimal Observation-Based Policies for Constrained POMDPs Under the Expected Average Reward Criterion

Xiaofeng Jiang,Xiaodong Wang,Hongsheng Xi,Falin Liu

doi:10.1109/tac.2015.2497904

Finding Optimal Observation-Based Policies for Constrained POMDPs Under the Expected Average Reward Criterion

Xiaofeng Jiang, Xiaodong Wang + Show 2 more

https://doi.org/10.1109/tac.2015.2497904

Copy DOI

Journal: IEEE Transactions on Automatic Control	Publication Date: Oct 1, 2016
Citations: 17

Affiliation: University of Science and Technology of China, Columbia University

#Average Reward Criterion #Imperfect State Information + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

In this technical note, constrained partially observable Markov decision processes with discrete state and action spaces under the average reward criterion are studied from a sensitivity point of view. By analyzing the derivatives of performance criteria, we develop a simulation-based optimization algorithm to find the optimal observation-based policy on the basis of a single sample path. This algorithm does not need any overly strict assumption and can be applied to the general ergodic Markov systems with the imperfect state information. The performance is proved to converge to the optimum with probability 1. One numerical example is provided to illustrate the applicability of the algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE Transactions on Automatic Control

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.