Entropy-Regularized Partially Observed Markov Decision Processes

Timothy L Molloy,Girish N Nair

doi:10.1109/tac.2023.3264177

Entropy-Regularized Partially Observed Markov Decision Processes

Timothy L Molloy, Girish N Nair

Open Access

https://doi.org/10.1109/tac.2023.3264177

Copy DOI

Journal: IEEE Transactions on Automatic Control	Publication Date: Jan 1, 2024
Citations: 1

Affiliation: University of Melbourne

#Partially Observed Markov Decision Processes #Joint Entropy + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We investigate partially observed Markov decision processes (POMDPs) with cost functions regularized by entropy terms describing state, observation, and control uncertainty. Standard POMDP techniques are shown to offer bounded-error solutions to these entropy-regularized POMDPs, with exact solutions possible when the regularization involves the joint entropy of the state, observation, and control trajectories. Our joint-entropy result is particularly surprising since it constitutes a novel, tractable formulation of active state estimation.

Full Text