Survival-Oriented Reinforcement Learning Model: An Effcient and Robust Deep Reinforcement Learning Algorithm for Autonomous Driving Problem

Changkun Ye,Huimin Ma,Kai Zhang,Shaodi You,Xiaoqin Zhang

doi:10.1007/978-3-319-71589-6_36

Survival-Oriented Reinforcement Learning Model: An Effcient and Robust Deep Reinforcement Learning Algorithm for Autonomous Driving Problem

Changkun Ye, Huimin Ma + Show 3 more

https://doi.org/10.1007/978-3-319-71589-6_36

Copy DOI

Publication Date: Jan 1, 2017

Citations: 9

Affiliation: Tsinghua University, Australian National University, Data61, Commonwealth Scientific and Industrial Research Organisation

#Deep Reinforcement Learning Algorithm #Constrained Markov Decision Process + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Using Deep Reinforcement Learning (DRL) algorithm to deal with autonomous driving tasks usually have unsatisfied performance due to lack of robustness and means to escape local optimum. In this article, we designs a Survival-Oriented Reinforcement Learning (SORL) model that tackle these problems by setting survival rather than maximize total reward as first priority. In SORL model, we model autonomous driving task as Constrained Markov Decision Process (CMDP) and introduce Negative-Avoidance Function to learn from previous failure. The SORL model greatly speed up the training process and improve the robustness of normal Deep Reinforcement Learning algorithm.

Full Text