Implications of stop-and-go traffic on training learning-based car-following control

Anye Zhou,Srinivas Peeta,Hao Zhou,Jorge Laval,Zejiang Wang,Adian Cook

doi:10.1016/j.trc.2024.104578

Abstract

Learning-based car-following control (LCC) of connected and autonomous vehicles (CAVs) is gaining significant attention with the advancement of computing power and data accessibility. While the flexibility and large model capacity of model-free architecture enable LCC to potentially outperform the model-based car-following (CF) model in improving traffic efficiency and mitigating congestion, the generalizability of LCC for traffic conditions different from the training environment/dataset is not well-understood. This study seeks to explore the impact of stop-and-go traffic in the training dataset on the generalizability of LCC. It uses the characteristics of lead vehicle trajectories to describe stop-and-go traffic, and links the theory of identifiability (i.e., obtaining a unique parameter estimation result using sensor measurements) to the generalizability of behavior cloning (BC) and policy-based deep reinforcement learning (DRL). Correspondingly, the study shows theoretically that: (i) stop-and-go traffic can enable the property of identifiability and enhance the control performance of BC-based LCC in different traffic conditions; (ii) stop-and-go traffic is not necessary for DRL-based LCC to generalize to different traffic conditions; (iii) DRL-based LCC trained with only constant-speed lead vehicle trajectories (not sufficient to ensure identifiability) can be generalized to different traffic conditions; and (iv) stop-and-go traffic increases variance in the training dataset, which improves the convergence of parameter estimation while negatively impacting the convergence of DRL to the optimal control policy. Numerical experiments validate the above findings, illustrating that BC-based LCC entails comprehensive training datasets for generalizing to different traffic conditions, while DRL-based LCC can achieve generalization with simple free-flow traffic training environments. This further suggests DRL as a more promising and cost-effective LCC approach to reduce operational costs, mitigate traffic congestion, and enhance safety and mobility, which can accelerate the deployment and acceptance of CAVs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Implications of stop-and-go traffic on training learning-based car-following control

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C

Lead the way for us

Similar Papers

Managing mixed traffic at signalized intersections: An adaptive signal control and CAV coordination system based on deep reinforcement learning
Duowei Li ... Tianyi Chen
Expert Systems with Applications | VOL. 238
Duowei Li, et. al.Duowei Li ... Tianyi Chen
07 Oct 2023
Expert Systems with Applications | VOL. 238

COOR-PLT: A hierarchical control model for coordinating adaptive platoons of connected and autonomous vehicles at signal-free intersections based on deep reinforcement learning
Duowei Li ... Jianping Wu
Transportation Research Part C: Emerging Technologies | VOL. 146
Duowei Li, et. al.Duowei Li ... Jianping Wu
29 Nov 2022
Transportation Research Part C: Emerging Technologies | VOL. 146

Modeling adaptive platoon and reservation‐based intersection control for connected and autonomous vehicles employing deep reinforcement learning
Duowei Li ... Tianyi Chen
Computer-Aided Civil and Infrastructure Engineering | VOL. 38
Duowei Li, et. al.Duowei Li ... Tianyi Chen
12 Dec 2022
Computer-Aided Civil and Infrastructure Engineering | VOL. 38

Proactive longitudinal control to preclude disruptive lane changes of human-driven vehicles in mixed-flow traffic
Yongyang Liu ... Srinivas Peeta
Control Engineering Practice | VOL. 136
Yongyang Liu, et. al.Yongyang Liu ... Srinivas Peeta
24 Apr 2023
Control Engineering Practice | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implications of stop-and-go traffic on training learning-based car-following control

Abstract

Talk to us

Similar Papers

More From: Transportation Research Part C