Accelerated Reward Policy (ARP) for Robotics Deep Reinforcement Learning

Harry Li,Nitin Patil,Chee Vang,Shifabanu Shaikh,Zhixuan Zhou,Nisarg Vadher,Shuwen Zheng,Allen Lee,Yusuke Yakuwa

doi:10.1007/978-3-030-98015-3_15

Abstract

AbstractReward policy is a crucial part for Deep Reinforcement Learning (DRL) applications in Robotics. The challenges for autonomous systems with “human-like” behavior have posed significant need for a better, faster, and more robust training based on optimized reward function. Inspired by the Berkeley and Google’s work, this paper addresses our recent development in reward policy/function design. In particular, we have formulated an accelerated reward policy (ARP) based on a non-linear functions. We have applied this reward function to SAC (Soft Actor Critic) algorithm for 6 DoF (Degree of Freedom) robot training in simulated environment using Unity Gaming platform and a 6 DoF robot. This nonlinear ARP function gives bigger reward to accelerate the robot’s positive behavior during the training. Comparing to the existing algorithm our experimental results demonstrated faster convergence and bigger, better accumulative reward. With limited experimental data, the results show improved accumulative reward function as much as 2 times of the previous results.KeywordsDeep Reinforcement LearningMachine learningAutonomous systems6 DoF robotUnity

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerated Reward Policy (ARP) for Robotics Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

3D measurement sensor system for rough terrain mobile robots
Sho Yokota ... Pierre Blazevic
Sensor Review | VOL. 27
Sho Yokota, et. al.Sho Yokota ... Pierre Blazevic
03 Jul 2007
Sensor Review | VOL. 27

High robustness energy management strategy of hybrid electric vehicle based on improved soft actor-critic deep reinforcement learning
Wenjing Sun ... Guodong Du
Energy | VOL. 258
Wenjing Sun, et. al.Wenjing Sun ... Guodong Du
14 Jul 2022
Energy | VOL. 258

Trajectory tracking control based on deep reinforcement learning and ensemble random network distillation for robotic manipulator
Jintao Hu ... Zhongye Xie
Journal of Physics: Conference Series | VOL. 2850
Jintao Hu, et. al.Jintao Hu ... Zhongye Xie
01 Sep 2024
Journal of Physics: Conference Series | VOL. 2850

A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn
Anjukan Kathirgamanathan ... Donal P Finn
-
Anjukan Kathirgamanathan, et. al.Anjukan Kathirgamanathan ... Donal P Finn
17 Nov 2020
17 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerated Reward Policy (ARP) for Robotics Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers