Reinforcement Learning with Factored States and Actions

Brian Sallans ,Geoffrey E Hinton

doi:10.5555/1005332.1016794

Reinforcement Learning with Factored States and Actions

Brian Sallans , Geoffrey E Hinton

https://doi.org/10.5555/1005332.1016794

Copy DOI

Journal: Journal of Machine Learning Research	Publication Date: Dec 1, 2004
Citations: 147

#Markov Chain Monte Carlo Sampling #Action Spaces + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A novel approximation method is presented for approximating the value function and selecting good actions for Markov decision processes with large state and action spaces. The method approximates state-action values as negative free energies in an undirected graphical model called a product of experts. The model parameters can be learned efficiently because values and derivatives can be efficiently computed for a product of experts. Actions can be found even in large factored action spaces by the use of Markov chain Monte Carlo sampling. Simulation results show that the product of experts approximation can be used to solve large problems. In one simulation it is used to find actions in action spaces of size 240.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Machine Learning Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.