Trustworthy safety improvement for autonomous driving using reinforcement learning

Zhong Cao,Shaobing Xu,Xinyu Jiao,Huei Peng,Diange Yang

doi:10.1016/j.trc.2022.103656

Abstract

Reinforcement learning (RL) can learn from past failures and has the potential to provide self-improvement ability and higher-level intelligence. However, the current RL algorithms still suffer from challenges in reliability, especially compared to the rule/model-based algorithms that are pre-engineered, human-input intensive, but widely used in autonomous vehicles. To take advantages of both the RL and rule-based algorithms, this work aims to design a decision-making framework that leverages RL and use an existing rule-based policy as its performance lower bound. In this way, the final policy remains the potential of self-learning, while guaranteeing a better system performance compared with the integrated rule-based policy. Such a decision-making framework is called trustworthy improvement RL (TiRL). The basic idea is to make the RL policy iteration process synchronously estimate the given rule-based policy’s value function. AV will then use the RL policy to drive only in the cases where the RL has learned a better policy, i.e., a higher policy value. This work takes highway safe driving as the case study. The results are obtained through more than 42,000 km driving in stochastic simulated traffic, and calibrated by naturalistic driving data. The TiRL planner is given two typical rule-based highway-driving policies for comparison. The results show that the TiRL can outperform the given arbitrary rule-based driving policy. In summary, the proposed TiRL can leverage the learning-based method in stochastic and emergent scenarios, while having a trustworthy safety improvement from the existing rule-based policies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Trustworthy safety improvement for autonomous driving using reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C: Emerging Technologies

Lead the way for us

Journal: Transportation Research Part C: Emerging Technologies	Publication Date: Mar 28, 2022
Citations: 29

Similar Papers

Robust Deep Reinforcement Learning for Security and Safety in Autonomous Vehicle Systems
Aidin Ferdowsi ... Walid Saad
-
Aidin Ferdowsi, et. al.Aidin Ferdowsi ... Walid Saad
01 Nov 2018
01 Nov 2018

Using Reinforcement Learning to Model Incrementality in a Fast-Paced Dialogue Game
Ramesh Manuvinakurike ... Kallirroi Georgila
-
Ramesh Manuvinakurike, et. al.Ramesh Manuvinakurike ... Kallirroi Georgila
01 Jan 2017
01 Jan 2017

Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples
Zhe Xu ... Ufuk Topcu
-
Zhe Xu, et. al.Zhe Xu ... Ufuk Topcu
01 Jan 2020
01 Jan 2020

Transfer Reinforcement Learning for Autonomous Driving
Aravind Balakrishnan ... Ashish Gaurav
ACM Transactions on Modeling and Computer Simulation | VOL. 31
Aravind Balakrishnan, et. al.Aravind Balakrishnan ... Ashish Gaurav
18 Jul 2021
ACM Transactions on Modeling and Computer Simulation | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Trustworthy safety improvement for autonomous driving using reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C: Emerging Technologies