A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.

Meng Li,Zehong Cao,Zhibin Li

doi:10.1109/tnnls.2021.3071959

Abstract

The vehicle platoon will be the most dominant driving mode on future roads. To the best of our knowledge, few reinforcement learning (RL) algorithms have been applied in vehicle platoon control, which has large-scale action and state spaces. Some RL-based methods were applied to solve single-agent problems. If we need to tackle multiagent problems, we will use multiagent RL algorithms since the parameters space grows exponentially with the increasing number of agents involved. Previous multiagent RL algorithms generally may provide redundant information to agents, indicating a large amount of useless or unrelated information, which may cause to be difficult for convergence training and pattern extractions from shared information. Also, random actions usually contribute to crashes, especially at the beginning of training. In this study, a communication proximal policy optimization (CommPPO) algorithm was proposed to tackle the above issues. In specific, the CommPPO model adopts a parameter-sharing structure to allow the dynamic variation of agent numbers, which can well handle various platoon dynamics, including splitting and merging. The communication protocol of the CommPPO consists of two parts. In the state part, the widely used predecessor-leader follower typology in the platoon is adopted to transmit global and local state information to agents. In the reward part, a new reward communication channel is proposed to solve the spurious reward and "lazy agent" problems in some existing multiagent RLs. Moreover, a curriculum learning approach is adopted to reduce crashes and speed up training. To validate the proposed strategy for platoon control, two existing multiagent RLs and a traditional platoon control strategy were applied in the same scenarios for comparison. Results showed that the CommPPO algorithm gained more rewards and achieved the largest fuel consumption reduction (11.6%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Dec 1, 2021
Citations: 45

Similar Papers

A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems
Erfu Yang ... Dongbing Gu
-
Erfu Yang, et. al.Erfu Yang ... Dongbing Gu
01 Oct 2006
01 Oct 2006

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics
S Abdallah ... V Lesser
Journal of Artificial Intelligence Research | VOL. 33
S Abdallah, et. al.S Abdallah ... V Lesser
17 Dec 2008
Journal of Artificial Intelligence Research | VOL. 33

Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning
Eren Balevi ... Jeffrey G Andrews
IEEE Transactions on Cognitive Communications and Networking | VOL. 5
Eren Balevi, et. al.Eren Balevi ... Jeffrey G Andrews
01 Dec 2019
IEEE Transactions on Cognitive Communications and Networking | VOL. 5

Rules-PPO-QMIX: Multi-Agent Reinforcement Learning with Mixed Rules for Large Scene Tasks
Zi-Zhen Shen ... Rui Yu
-
Zi-Zhen Shen, et. al.Zi-Zhen Shen ... Rui Yu
22 Oct 2021
22 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Reinforcement Learning-Based Vehicle Platoon Control Strategy for Reducing Energy Consumption in Traffic Oscillations.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems