A Monte-Carlo AIXI Approximation

J Veness,D Silver,K.S Ng,M Hutter,W Uther

doi:10.1613/jair.3125

A Monte-Carlo AIXI Approximation

J Veness, D Silver + Show 3 more

Open Access

https://doi.org/10.1613/jair.3125

Copy DOI

Journal: Journal of Artificial Intelligence Research	Publication Date: Jan 24, 2011
Citations: 211	License type: publisher-specific-oa

#Monte-Carlo Tree Search Algorithm #Design Of Reinforcement + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a new Monte-Carlo Tree Search algorithm along with an agent-specific extension to the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a variety of stochastic and partially observable domains. We conclude by proposing a number of directions for future research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Artificial Intelligence Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.