Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes

Yulong Yang,Weihua Cao,Chao Gan,Min Wu,Linwei Guo

doi:10.1109/icps58381.2023.10128012

Abstract

High precision modeling in industrial systems is difficult and costly. Model-free intelligent control methods, represented by reinforcement learning, have been applied in industrial systems broadly. The hard evaluated of production states and the low value density of processing data causes sparse rewards, which lead to an insufficient performance of reinforcement learning. To overcome the difficulty of reinforcement learning in sparse reward scenes, a reinforcement learning method with reward shaping and hybrid exploration is proposed. By perfecting the rewards distribution in the state space of environment, the reward shaping can make the state-value estimation of reinforcement learning more accurate. By improving the rewards distribution in time dimension, the hybrid exploration can make the iteration of reinforcement learning more efficient and more stable. Finally, the effectiveness of the proposed method is verified by simulations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Reinforcement Learning with Sparse and Multiple Rewards

-

13 Feb 2020
13 Feb 2020

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Evolutionary Reinforcement Learning: A Survey
Hui Bai ... Ran Cheng
Intelligent Computing | VOL. 2
Hui Bai, et. al.Hui Bai ... Ran Cheng
01 Jan 2023
Intelligent Computing | VOL. 2

Predictive control of power demand peak regulation based on deep reinforcement learning
Qiming Fu ... Yunzhe Wang
Journal of Building Engineering | VOL. 75
Qiming Fu, et. al.Qiming Fu ... Yunzhe Wang
01 Sep 2023
Journal of Building Engineering | VOL. 75

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes

Abstract

Talk to us

Similar Papers