Reinforcement Learning with Bounded Information Loss

Jan Peters,Katharina MüLling,Yasemin Altun,Yevgeny Seldin,Pierre BessiéRe,Jean-FrançOis Bercher,Ali Mohammad-Djafari

doi:10.1063/1.3573639

Abstract

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature convergence and implausible solutions. As first suggested in the context of covariant or natural policy gradients, many of these problems may be addressed by constraining the information loss. In this paper, we continue this path of reasoning and suggest two reinforcement learning methods, i.e., a model‐based and a model free algorithm that bound the loss in relative entropy while maximizing their return. The resulting methods differ significantly from previous policy gradient approaches and yields an exact update step. It works well on typical reinforcement learning benchmark problems as well as novel evaluations in robotics. We also show a Bayesian bound motivation of this new approach [8].

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Bounded Information Loss

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Relative Entropy Policy Search
Jan Peters ... Yasemin Altun
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24
Jan Peters, et. al.Jan Peters ... Yasemin Altun
05 Jul 2010
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24

Natural Actor-Critic
Jan Peters ... Stefan Schaal
Neurocomputing | VOL. 71
Jan Peters, et. al.Jan Peters ... Stefan Schaal
01 Feb 2008
Neurocomputing | VOL. 71

Natural Actor-Critic
Jan Peters ... Stefan Schaal
-
Jan Peters, et. al.Jan Peters ... Stefan Schaal
01 Jan 2004
01 Jan 2004

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Bounded Information Loss

Abstract

Talk to us

Similar Papers