Automating Vehicles by Risk-Averse Preview-based Q-Learning Algorithm

Majid Mazouchi,Subramanya Nageshrao,Hamidreza Modares

doi:10.1016/j.ifacol.2022.07.616

Abstract

A risk-averse preview-based Q-learning planner is presented for navigation of autonomous vehicles. To this end, the multi-lane road ahead of a vehicle is represented by a finite-state non-stationary Markov decision process (MDP). A sampling-based risk-averse preview-based Q-learning algorithm is finally developed that generates samples using the preview information and reward function to learn risk-averse optimal planning strategies without actual interaction with the environment. The risk factor is imposed on the objective function to avoid fluctuation of the Q values, which can jeopardize the vehicle's safety and/or performance. Theoretical results are provided to bound the number of samples required to guarantee ϵ-optimal planning with a high probability. Finally, to verify the efficiency of the presented algorithm, its implementation on highway driving of an autonomous vehicle in a varying traffic density is considered.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automating Vehicles by Risk-Averse Preview-based Q-Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Journal: IFAC PapersOnLine	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

A Risk-Averse Preview-Based Q-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles
Majid Mazouchi ... Hamidreza Modares
IEEE Transactions on Control Systems Technology | VOL. 31
Majid Mazouchi, et. al.Majid Mazouchi ... Hamidreza Modares
01 Jul 2023
IEEE Transactions on Control Systems Technology | VOL. 31

On the Optimality of Structured Policies in Countable Stage Decision Processes
Evan L Porteus
Management Science | VOL. 22
Evan L PorteusEvan L Porteus
01 Oct 1975
Management Science | VOL. 22

A Strongly Polynomial Algorithm for Controlled Queues
Alexander Zadorojniy ... Guy Even
Mathematics of Operations Research | VOL. 34
Alexander Zadorojniy, et. al.Alexander Zadorojniy ... Guy Even
01 Nov 2009
Mathematics of Operations Research | VOL. 34

6 Algorithms and complexity for markov processes
Ravi Varadarajan
Handbook of Statistics | VOL. 9
Ravi VaradarajanRavi Varadarajan
01 Jan 1992
Handbook of Statistics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automating Vehicles by Risk-Averse Preview-based Q-Learning Algorithm

Abstract

Talk to us

Similar Papers

More From: IFAC PapersOnLine