Reinforcement Learning based cooperative longitudinal control for reducing traffic oscillations and improving platoon stability

Liming Jiang,Yuanchang Xie,Nicholas G Evans,Xiao Wen,Tienan Li,Danjue Chen

doi:10.1016/j.trc.2022.103744

Liming Jiang, Yuanchang Xie + Show 4 more

Open Access

https://doi.org/10.1016/j.trc.2022.103744

Copy DOI

Abstract

Stop-and-go traffic poses significant challenges to the efficiency and safety of traffic operations. In this study, a cooperative longitudinal control based on Soft Actor Critic (SAC) Reinforcement Learning (RL) is proposed to address this issue. The reward function is carefully designed to consider vehicle cooperation and to achieve three main objectives: safety, efficiency, and oscillation dampening. A global performance metric for oscillation dampening is proposed to evaluate the developed RL and other baseline models. Depending on the number of preceding vehicles that can share maneuver information, two models RL-1 and RL-2 are proposed and compared with human driven (HD) and an adaptive cruise control (ACC) model using the HighD and simulated data. It is found that with information from additional preceding vehicles, RL-2 can dampen shockwaves more efficiently. Specifically, RL-1 and RL-2 decrease traffic oscillation by 15%-36% and 15%-42%, respectively, while HD amplifies the oscillation by 14–37%. The ACC model can also dampen shockwaves but is not as effective as RL-1 and RL-2. The two RL control methods are further evaluated based on data collected using a commercial Model X vehicle. Compared with the commercial Model X ACC vehicle in some controlled settings, the proposed RL methods can better dampen the stop-and-go waves by generating smaller oscillation growth, overshooting, and average acceleration/deceleration rate change, suggesting that they can generalize well in a new but similar environment. Finally, the RL methods are evaluated considering a platoon of vehicles with different RL penetration rates. The results show that they consistently outperform HD and ACC in dampening shockwaves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transportation Research Part C: Emerging Technologies	Publication Date: Jun 9, 2022
Citations: 22	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Reinforcement Learning based cooperative longitudinal control for reducing traffic oscillations and improving platoon stability

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C: Emerging Technologies

Lead the way for us

Similar Papers

Safety impact of cooperative adaptive cruise control vehicles’ degradation under spatial continuous communication interruption
Weijie Yu ... De Zhao
IET Intelligent Transport Systems | VOL. 16
Weijie Yu, et. al.Weijie Yu ... De Zhao
28 Nov 2021
IET Intelligent Transport Systems | VOL. 16

Application of gas-kinetic theory to modelling mixed traffic of manual and ACC vehicles
D Ngoduy
Transportmetrica | VOL. 8
D NgoduyD Ngoduy
01 Jan 2012
Transportmetrica | VOL. 8

A hybrid traffic flow model with considering the influence of adaptive cruise control vehicles and on-ramps
Hua Xue-Dong ... Wang Hao
Acta Physica Sinica | VOL. 65
Hua Xue-Dong, et. al. Hua Xue-Dong ... Wang Hao
01 Jan 2015
Acta Physica Sinica | VOL. 65

Car-following behavior characteristics of adaptive cruise control vehicles based on empirical experiments
Tienan Li ... Yuanchang Xie
Transportation Research Part B: Methodological | VOL. 147
Tienan Li, et. al.Tienan Li ... Yuanchang Xie
30 Mar 2021
Transportation Research Part B: Methodological | VOL. 147

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning based cooperative longitudinal control for reducing traffic oscillations and improving platoon stability

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C: Emerging Technologies