Reward Space Noise for Exploration in Deep Reinforcement Learning

Chuxiong Sun,Qian Li,Rui Wang,Xiaohui Hu

doi:10.1142/s0218001421520133

Abstract

A fundamental challenge for reinforcement learning (RL) is how to achieve efficient exploration in initially unknown environments. Most state-of-the-art RL algorithms leverage action space noise to drive exploration. The classical strategies are computationally efficient and straightforward to implement. However, these methods may fail to perform effectively in complex environments. To address this issue, we propose a novel strategy named reward space noise (RSN) for farsighted and consistent exploration in RL. By introducing the stochasticity from reward space, we are able to change agent’s understanding about environment and perturb its behaviors. We find that the simple RSN can achieve consistent exploration and scale to complex domains without intensive computational cost. To demonstrate the effectiveness and scalability of the proposed method, we implement a deep Q-learning agent with reward noise and evaluate its exploratory performance on a set of Atari games which are challenging for the naive [Formula: see text]-greedy strategy. The results show that reward noise outperforms action noise in most games and performs comparably in others. Concretely, we found that in the early training, the best exploratory performance of reward noise is obviously better than action noise, which demonstrates that the reward noise can quickly explore the valuable states and aid in finding the optimal policy. Moreover, the average scores and learning efficiency of reward noise are also higher than action noise through the whole training, which indicates that the reward noise can generate more stable and consistent performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reward Space Noise for Exploration in Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: May 21, 2021
Citations: 1

Similar Papers

Towards High-Level Intrinsic Exploration in Reinforcement Learning
Nicolas Bougie ... Ryutaro Ichise
-
Nicolas Bougie, et. al.Nicolas Bougie ... Ryutaro Ichise
01 Jul 2020
01 Jul 2020

Exploiting Action-Value Uncertainty to Drive Exploration in Reinforcement Learning
Carlo D'Eramo ... Andrea Cini
-
Carlo D'Eramo, et. al.Carlo D'Eramo ... Andrea Cini
01 Jul 2019
01 Jul 2019

Strategic Exploration in Reinforcement Learning - New Algorithms and Learning Guarantees

-

24 Feb 2020
24 Feb 2020

Improving exploration efficiency of deep reinforcement learning through samples produced by generative model
Dayong Xu ... Peiyao Zhao
Expert Systems With Applications | VOL. 185
Dayong Xu, et. al.Dayong Xu ... Peiyao Zhao
30 Jul 2021
Expert Systems With Applications | VOL. 185

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reward Space Noise for Exploration in Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence