Solving Hybrid Markov Decision Processes

Alberto Reyes,Pablo H Ibargüengoytia,L Enrique Sucar,Eduardo F Morales

doi:10.1007/11925231_22

Abstract

Markov decision processes (MDPs) have developed as a standard for representing uncertainty in decision-theoretic planning. However, MDPs require an explicit representation of the state space and the probabilistic transition model which, in continuous or hybrid continuous-discrete domains, are not always easy to define. Even when this representation is available, the size of the state space and the number of state variables to consider in the transition function may be such that the resulting MDP cannot be solved using traditional techniques. In this paper a reward-based abstraction for solving hybrid MDPs is presented. In the proposed method, we gather information about the rewards and the dynamics of the system by exploring the environment. This information is used to build a decision tree (C4.5) representing a small set of abstract states with equivalent rewards, and then is used to learn a probabilistic transition function using a Bayesian networks learning algorithm (K2). The system output is a problem specification ready for its solution with traditional dynamic programming algorithms. We have tested our abstract MDP model approximation in real-world problem domains. We present the results in terms of the models learned and their solutions for different configurations showing that our approach produces fast solutions with satisfying policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Solving Hybrid Markov Decision Processes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Link Analysis for Solving Multiple-Access MDPs With Large State Spaces
Talha Bozkus ... Urbashi Mitra
IEEE Transactions on Signal Processing | VOL. 71
Talha Bozkus, et. al.Talha Bozkus ... Urbashi Mitra
01 Jan 2023
IEEE Transactions on Signal Processing | VOL. 71

Solving Markov Decision Processes via Simulation
Abhijit Gosavi
-
Abhijit GosaviAbhijit Gosavi
18 Sep 2014
18 Sep 2014

Reward-predictive representations generalize across tasks in reinforcement learning
Lucas Lehnert ... Michael L Littman
-
Lucas Lehnert, et. al.Lucas Lehnert ... Michael L Littman
15 Oct 2020
15 Oct 2020

Reward-predictive representations generalize across tasks in reinforcement learning.
Lucas Lehnert ... Michael L Littman
PLOS Computational Biology | VOL. 16
Lucas Lehnert, et. al.Lucas Lehnert ... Michael L Littman
15 Oct 2020
PLOS Computational Biology | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solving Hybrid Markov Decision Processes

Abstract

Talk to us

Similar Papers