Research on decision making of intelligent vehicle based on composite priority experience replay

Shufeng Wang,Xinkai Wang,Baokang Zhang,Qingwei Liang

doi:10.3233/idt-230271

Abstract

To address the problems of underutilization of samples and unstable training for intelligent vehicle training in the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, a TD3 algorithm based on the Composite Prioritized Experience Replay (CPR-TD3) mechanism is proposed. It considers experience immediate reward value and Temporal Difference error (TD-error) based and respectively to construct priorities to rank the samples. Subsequently composite average ranking of the samples to recalculate the priorities for sampling, uses the collected samples to train the target network. Then introduces the minimum lane change distance and the variable headway time distance to improve the reward function. Finally, the improved algorithm is proved to be effective by comparing it with the traditional TD3 on the highway scenario, and the CPR-TD3 algorithm improves the training efficiency of intelligent vehicles.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on decision making of intelligent vehicle based on composite priority experience replay

Abstract

Talk to us

Similar Papers

More From: Intelligent Decision Technologies

Lead the way for us

Similar Papers

Model-Free Control in Wireless Cyber-Physical System With Communication Latency: A DRL Method With Improved Experience Replay.
Yifei Qiu ... Ning Zhang
IEEE Transactions on Cybernetics | VOL. PP
Yifei Qiu, et. al.Yifei Qiu ... Ning Zhang
01 Jul 2023
IEEE Transactions on Cybernetics | VOL. PP

An Improved DDPG Algorithm with Barrier Function for Lane-Change Decision-Making of Intelligent Vehicles
Tianshuo Feng ... Xiaochuan Zhang
-
Tianshuo Feng, et. al.Tianshuo Feng ... Xiaochuan Zhang
01 Jan 2020
01 Jan 2020

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Evgeny Neretin
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Evgeny Neretin
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on decision making of intelligent vehicle based on composite priority experience replay

Abstract

Talk to us

Similar Papers

More From: Intelligent Decision Technologies