Trial and Error Experience Replay Based Deep Reinforcement Learning

Cheng Zhang,Liang Ma

doi:10.1109/smartcloud.2019.00045

Trial and Error Experience Replay Based Deep Reinforcement Learning

Cheng Zhang, Liang Ma

https://doi.org/10.1109/smartcloud.2019.00045

Copy DOI

Publication Date: Dec 1, 2019

Citations: 17

Affiliation: Waseda University, Jiangnan University

#Experience Replay #Rewards In Reinforcement Learning + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The environment with sparse rewards in reinforcement learning is a common problem and the agent learns inefficiently using general methods. A new solution called trialand-error experience replay is proposed. In this method, the general hindsight experience replay is combined with a curiositydriven model, by which the sample-efficiency will be improved although extrinsic rewards are sparse. It is demonstrated as an algorithm to control a virtual robotic arm to reach a mobile goal. Through analysis the robotic arm can explore and learn based on failure trajectories which shows that the agent mimics a human who failed repeatedly but still tries to learn something from the unexpected outcomes.

Full Text