Formula-E race strategy development using distributed policy gradient reinforcement learning

Xuze Liu,Abbas Fotouhi,Daniel J Auger

doi:10.1016/j.knosys.2021.106781

Xuze Liu, Abbas Fotouhi + Show 1 more

Open Access

https://doi.org/10.1016/j.knosys.2021.106781

Copy DOI

Journal: Knowledge-Based Systems	Publication Date: Jan 20, 2021
Citations: 6	License type: cc-by-nc-nd

Affiliation: Cranfield University

Abstract

Energy and thermal management is a crucial element in Formula-E race strategy development. In this study, the race-level strategy development is formulated into a Markov decision process (MDP) problem featuring a hybrid-type action space. Deep Deterministic Policy Gradient (DDPG) reinforcement learning is implemented under distributed architecture Ape-X and integrated with the prioritized experience replay and reward shaping techniques to optimize a hybrid-type set of actions of both continuous and discrete components. Soft boundary violation penalties in reward shaping, significantly improves the performance of DDPG and makes it capable of generating faster race finishing solutions. The new proposed method has shown superior performance in comparison to the Monte Carlo Tree Search (MCTS) with policy gradient reinforcement learning, which solves this problem in a fully discrete action space as presented in the literature. The advantages are faster race finishing time and better handling of ambient temperature rise.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Formula-E race strategy development using distributed policy gradient reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

A review of motion planning algorithms for intelligent robots
Chengmin Zhou ... Bingding Huang
Journal of Intelligent Manufacturing | VOL. 33
Chengmin Zhou, et. al.Chengmin Zhou ... Bingding Huang
25 Nov 2021
Journal of Intelligent Manufacturing | VOL. 33

Reinforcement Learning for the Agile Earth-Observing Satellite Scheduling Problem
Adam Herrmann ... Hanspeter Schaub
IEEE Transactions on Aerospace and Electronic Systems | VOL. -
Adam Herrmann, et. al.Adam Herrmann ... Hanspeter Schaub
01 Jan 2023
IEEE Transactions on Aerospace and Electronic Systems | VOL. -

PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning
Shilei Li ... Meng Li
ACM Transactions on Intelligent Systems and Technology | VOL. 12
Shilei Li, et. al.Shilei Li ... Meng Li
03 Jun 2021
ACM Transactions on Intelligent Systems and Technology | VOL. 12

Reducing Transmission Delay in EDCA Using Policy Gradient Reinforcement Learning
Masao Shinzaki ... Yusuke Koda
-
Masao Shinzaki, et. al.Masao Shinzaki ... Yusuke Koda
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formula-E race strategy development using distributed policy gradient reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems