Representation Discovery for MDPs Using Bisimulation Metrics

Sherry Ruan,Gheorghe Comanici,Doina Precup,Prakash Panangaden

doi:10.1609/aaai.v29i1.9701

Abstract

We provide a novel, flexible, iterative refinement algorithm to automatically construct an approximate statespace representation for Markov Decision Processes (MDPs). Our approach leverages bisimulation metrics, which have been used in prior work to generate features to represent the state space of MDPs. We address a drawback of this approach, which is the expensive computation of the bisimulation metrics. We propose an algorithm to generate an iteratively improving sequence of state space partitions. Partial metric computations guide the representation search and provide much lower space and computational complexity, while maintaining strong convergence properties. We provide theoretical results guaranteeing convergence as well as experimental illustrations of the accuracy and savings (in time and memory usage) of the new algorithm, compared to traditional bisimulation metric computation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Representation Discovery for MDPs Using Bisimulation Metrics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 4, 2015
Citations: 6

Similar Papers

Representation Discovery for MDPs Using Bisimulation Metrics
Sherry Ruan ... Prakash Panangaden
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 29
Sherry Ruan, et. al.Sherry Ruan ... Prakash Panangaden
04 Mar 2015
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 29

Scalable Methods for Computing State Similarity in Deterministic Markov Decision Processes
Pablo Samuel Castro
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Pablo Samuel CastroPablo Samuel Castro
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics
Pablo Samuel Castro ... Doina Precup
-
Pablo Samuel Castro, et. al.Pablo Samuel Castro ... Doina Precup
01 Jan 2012
01 Jan 2012

Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns
Yong Liu ... Changjie Fan
-
Yong Liu, et. al.Yong Liu ... Changjie Fan
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Representation Discovery for MDPs Using Bisimulation Metrics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence