Planning and acting in partially observable stochastic domains

Leslie Pack Kaelbling,Michael L Littman,Anthony R Cassandra

doi:10.1016/s0004-3702(98)00023-x

Planning and acting in partially observable stochastic domains

Leslie Pack Kaelbling, Michael L Littman + Show 1 more

Open Access

https://doi.org/10.1016/s0004-3702(98)00023-x

Copy DOI

Journal: Artificial Intelligence	Publication Date: May 1, 1998
Citations: 3687	License type: elsevier-specific

Affiliation: Brown University, Duke University

#Theory Of Markov Decision Processes #Theory Of Processes + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we bring techniques from operations research to bear on the problem of choosing optimal actions in partially observable stochastic domains. We begin by introducing the theory of Markov decision processes (mdps) and partially observable MDPs (pomdps). We then outline a novel algorithm for solving pomdps off line and show how, in some cases, a finite-memory controller can be extracted from the solution to a POMDP. We conclude with a discussion of how our approach relates to previous work, the complexity of finding exact solutions to pomdps, and of some possibilities for finding approximate solutions.

Full Text