Acceleration of reinforcement learning by policy evaluation using nonstationary iterative method.

Kei Senda,Takehisa Kohda,Suguru Hattori,Toru Hishinuma

doi:10.1109/tcyb.2014.2313655

Acceleration of reinforcement learning by policy evaluation using nonstationary iterative method.

Kei Senda, Takehisa Kohda + Show 2 more

Open Access

https://doi.org/10.1109/tcyb.2014.2313655

Copy DOI

Journal: IEEE Transactions on Cybernetics	Publication Date: Apr 10, 2014
Citations: 25	License type: other-oa

Affiliation: Kyoto University

#Nonstationary Iterative Method #Krylov Subspace Method + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Typical methods for solving reinforcement learning problems iterate two steps, policy evaluation and policy improvement. This paper proposes algorithms for the policy evaluation to improve learning efficiency. The proposed algorithms are based on the Krylov Subspace Method (KSM), which is a nonstationary iterative method. The algorithms based on KSM are tens to hundreds times more efficient than existing algorithms based on the stationary iterative methods. Algorithms based on KSM are far more efficient than they have been generally expected. This paper clarifies what makes algorithms based on KSM makes more efficient with numerical examples and theoretical discussions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

More From: IEEE Transactions on Cybernetics

Paper Title

Journal

Date

Author

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.