PAC Optimal Exploration in Continuous Space Markov Decision Processes

Jason Pazis,Ronald Parr

doi:10.1609/aaai.v27i1.8678

PAC Optimal Exploration in Continuous Space Markov Decision Processes

Jason Pazis, Ronald Parr

Open Access

https://doi.org/10.1609/aaai.v27i1.8678

Copy DOI

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 30, 2013
Citations: 48

Affiliation: Duke University

#Finite Sample Guarantees #Continuous Space Problems + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Current exploration algorithms can be classified in two broad categories: Heuristic, and PAC optimal. While numerous researchers have used heuristic approaches such as epsilon-greedy exploration successfully, such approaches lack formal, finite sample guarantees and may need a significant amount of fine-tuning to produce good results. PAC optimal exploration algorithms, on the other hand, offer strong theoretical guarantees but are inapplicable in domains of realistic size. The goal of this paper is to bridge the gap between theory and practice, by introducing C-PACE, an algorithm which offers strong theoretical guarantees and can be applied to interesting, continuous space problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.