Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages.

Sampo Kuutti,Richard Bowden,Saber Fallah

doi:10.3390/s21062032

Abstract

The use of neural networks and reinforcement learning has become increasingly popular in autonomous vehicle control. However, the opaqueness of the resulting control policies presents a significant barrier to deploying neural network-based control in autonomous vehicles. In this paper, we present a reinforcement learning based approach to autonomous vehicle longitudinal control, where the rule-based safety cages provide enhanced safety for the vehicle as well as weak supervision to the reinforcement learning agent. By guiding the agent to meaningful states and actions, this weak supervision improves the convergence during training and enhances the safety of the final trained policy. This rule-based supervisory controller has the further advantage of being fully interpretable, thereby enabling traditional validation and verification approaches to ensure the safety of the vehicle. We compare models with and without safety cages, as well as models with optimal and constrained model parameters, and show that the weak supervision consistently improves the safety of exploration, speed of convergence, and model performance. Additionally, we show that when the model parameters are constrained or sub-optimal, the safety cages can enable a model to learn a safe driving policy even when the model could not be trained to drive through reinforcement learning alone.

Highlights

Autonomous driving has gained significant attention within the automotive research community in recent years [1,2,3]
We demonstrate that by using the weak supervision from the safety cages during training, the shallow model which otherwise could not learn to drive can be enabled to learn to drive without collisions
We demonstrated that the interventions by the safety cages can be used to re-train the neural networks in a supervised learning approach, enabling the system to learn from its own mistakes and further making the controller more robust

Summary

Introduction

Autonomous driving has gained significant attention within the automotive research community in recent years [1,2,3]. While imitation learning based approaches have shown important progress in autonomous driving [27,28,29,30], they present limitations when deployed in environments beyond the training distribution [31]. These driving models relying on supervised techniques are often evaluated on performance metrics on pre-collected validation datasets [32], low prediction error on offline testing is not necessarily correlated with driving quality [33]. We focus on longitudinal control and extend on our previous work on RL-based longitudinal control in a highway driving environment [20]

Safety Cages

Reinforcement Learning

Deep Deterministic Policy Gradient

Highway Vehicle Following Use-Case

Training

Results

Naturalistic Testing

Adversarial Testing

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Mar 13, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Deep Q‐network implementation for simulated autonomous vehicle control
Yang Thee Quek ... Li Ling Koh
IET Intelligent Transport Systems | VOL. 15
Yang Thee Quek, et. al.Yang Thee Quek ... Li Ling Koh
05 May 2021
IET Intelligent Transport Systems | VOL. 15

A survey on autonomous vehicle control in the era of mixed-autonomy: From physics-based to AI-guided driving policy learning
Xuan Di ... Rongye Shi
Transportation Research Part C: Emerging Technologies | VOL. 125
Xuan Di, et. al.Xuan Di ... Rongye Shi
11 Mar 2021
Transportation Research Part C: Emerging Technologies | VOL. 125

Learning‐based robust control methodologies under information constraints
Hamid Reza Karimi ... Ning Wang
International Journal of Robust and Nonlinear Control | VOL. 32
Hamid Reza Karimi, et. al.Hamid Reza Karimi ... Ning Wang
26 Jan 2022
International Journal of Robust and Nonlinear Control | VOL. 32

Deep Learning for Autonomous Vehicle Control: Algorithms, State-of-the-Art, and Future Prospects
Sampo Kuutti ... Saber Fallah
Synthesis Lectures on Advances in Automotive Technology | VOL. 3
Sampo Kuutti, et. al.Sampo Kuutti ... Saber Fallah
08 Aug 2019
Synthesis Lectures on Advances in Automotive Technology | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors