DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning – Extended Version

Timo P Gros,Nicola J Müller,Hendrik Meerkamp,Verena Wolf,Jörg Hoffmann,Lukas Schaller,Michaela Klauck,Daniel Höller,Joschka Groß

doi:10.1145/3607198

Abstract

Neural networks (NN) are gaining importance in sequential decision-making. Deep reinforcement learning (DRL), in particular, is extremely successful in learning action policies in complex and dynamic environments. Despite this success, however, DRL technology is not without its failures, especially in safety-critical applications: (i) the training objective maximizes average rewards, which may disregard rare but critical situations and hence lack local robustness; (ii) optimization objectives targeting safety typically yield degenerated reward structures, which, for DRL to work, must be replaced with proxy objectives. Here, we introduce a methodology that can help to address both deficiencies. We incorporate evaluation stages (ES) into DRL, leveraging recent work on deep statistical model checking (DSMC), which verifies NN policies in Markov decision processes. Our ES apply DSMC at regular intervals to determine state space regions with weak performance. We adapt the subsequent DRL training priorities based on the outcome, (i) focusing DRL on critical situations and (ii) allowing to foster arbitrary objectives. We run case studies on two benchmarks. One of them is the Racetrack, an abstraction of autonomous driving that requires navigating a map without crashing into a wall. The other is MiniGrid, a widely used benchmark in the AI community. Our results show that DSMC-based ES can significantly improve both (i) and (ii).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning – Extended Version

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Modeling and Computer Simulation

Lead the way for us

Journal: ACM Transactions on Modeling and Computer Simulation	Publication Date: Oct 26, 2023
Citations: 1

Similar Papers

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning
Timo P Gros ... Verena Wolf
-
Timo P Gros, et. al.Timo P Gros ... Verena Wolf
01 Jan 2020
01 Jan 2020

Autonomous Driving Decision-making Based on the Combination of Deep Reinforcement Learning and Rule-based Controller
Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
-
Jinzhu Wang Jinzhu Wang, et. al.Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
30 Sep 2021
30 Sep 2021

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Lanyu Xu ... Ning Wen
Physics in Medicine & Biology | VOL. 67
Lanyu Xu, et. al.Lanyu Xu ... Ning Wen
11 Nov 2022
Physics in Medicine & Biology | VOL. 67

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning – Extended Version

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Modeling and Computer Simulation