Action Redundancy in Reinforcement Learning

Nir Baram ,Shie Mannor ,Tennenholtz Guy

doi:10.48448/np6e-zp80

Abstract

Maximum Entropy (MaxEnt) reinforcement learning is a powerful learning paradigm which seeks to maximize return under entropy regularization. However, action entropy does not necessarily coincide with state entropy, e.g., when multiple actions produce the same transition. Instead, we propose to maximize the transition entropy, i.e., the entropy of next states. We show that transition entropy can be described by two terms; namely, model-dependent transition entropy and action redundancy. Particularly, we explore the latter in both deterministic and stochastic settings and develop tractable approximation methods in a near model-free setup. We construct algorithms to minimize action redundancy and demonstrate their effectiveness on a synthetic environment with multiple redundant actions as well as contemporary benchmarks in Atari and Mujoco. Our results suggest that action redundancy is a fundamental problem in reinforcement learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action Redundancy in Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Direct policy search with extremum seeking
Megumi Miyashita ... Ryo Hirotani
-
Megumi Miyashita, et. al.Megumi Miyashita ... Ryo Hirotani
01 Sep 2017
01 Sep 2017

Analysis of an evolutionary reinforcement learning method in a multiagent domain
...
-
, et. al. ...
12 May 2008
12 May 2008

Reinforcement Learning in System Identification
Mariela Cerrada ... Jose Aguilar
-
Mariela Cerrada, et. al.Mariela Cerrada ... Jose Aguilar
01 Jan 2008
01 Jan 2008

Reinforcement Learning with Kernel Recursive Least-Squares Support Vector Machine
Hitesh Shah ... M Gopal
International Journal of Machine Learning and Computing | VOL. -
Hitesh Shah, et. al.Hitesh Shah ... M Gopal
01 Jan 2012
International Journal of Machine Learning and Computing | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action Redundancy in Reinforcement Learning

Abstract

Talk to us

Similar Papers