Lane Following Method Based on Improved DDPG Algorithm.

Rui He,Sumin Zhang,Dong Zhang,Hang Zhang,Haipeng Lv

doi:10.3390/s21144827

Rui He, Sumin Zhang + Show 3 more

Open Access

https://doi.org/10.3390/s21144827

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Jul 15, 2021
Citations: 9	License type: CC BY 4.0

Affiliation: Jilin University

Abstract

In an autonomous vehicle, the lane following algorithm is an important component, which is a basic function of autonomous driving. However, the existing lane following system has a few shortcomings: first, the control method it adopts requires an accurate system model, and different vehicles have different parameters, which needs a lot of parameter calibration work. The second is that it may fail on road sections where the lateral acceleration requirements of vehicles are large, such as large curves. Third, its decision-making system is defined based on rules, which has disadvantages: it is difficult to formulate; human subjective factors cannot guarantee objectivity; coverage is difficult to guarantee. In recent years, the deep deterministic policy gradient (DDPG) algorithm has been widely used in the field of autonomous driving due to its strong nonlinear fitting ability and generalization performance. However, the DDPG algorithm has overestimated state action values and large cumulative errors, low training efficiency and other issues. Therefore, this paper improves the DDPG algorithm based on the double critic networks and priority experience replay mechanism. Then this paper proposes a lane following method based on this algorithm. Experiment shows that the algorithm can achieve excellent following results under various road conditions.

Highlights

Lane following is one of the most important autonomous driving subsystems
In order to solve this problem, the authors in [22] propose the deep deterministic policy gradient (DDPG) algorithm, which is an algorithm based on direct policy search that can directly output continuous action values, which is very suitable for continuous control environments
The angle with the lane axis is reduced by 40%, and the distance from the road centerline is reduced by 49%, indicating that the algorithm in this paper is processing the lane following task is significantly better than the DDPG algorithm

Summary

Introduction

Lane following is one of the most important autonomous driving subsystems. Only after successfully implementing the lane following function can other advanced subsystems of autonomous driving such as obstacle avoidance and car following be further developed [1]. In order to solve this problem, the authors in [22] propose the DDPG algorithm, which is an algorithm based on direct policy search that can directly output continuous action values, which is very suitable for continuous control environments. The author applied it to lane following and achieved good results in the TORCS environment. A project that requires a lot of manpower and material resources will cause much waste of resources For dealing with this problem, this paper proposes double critic networks and priority experience replay deep deterministic policy gradient (DCPER-DDPG) algorithm. This paper proposes a lane following algorithm architecture based on deep reinforcement learning; secondly, designs the reward function, exploration strategy, and improved DDPG algorithm; the algorithm proposed in this paper is tested and verified on the TORCS simulation platform

The execution of the algorithm is as follow: in Figure

Network

Critic Network Structure

Reward Function

Exploration

Double Critic Networks and Priority Experience Replay of DDPG Algorithm

Simulation Environment

Termination Condition Setting

Training

Number

Analysis of Comparative Results

Vehicle

Normalized

11. Schematic

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lane Following Method Based on Improved DDPG Algorithm.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Shaomei Song
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Shaomei Song
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

An Improved DDPG Algorithm with Barrier Function for Lane-Change Decision-Making of Intelligent Vehicles
Tianshuo Feng ... Xiaochuan Zhang
-
Tianshuo Feng, et. al.Tianshuo Feng ... Xiaochuan Zhang
01 Jan 2020
01 Jan 2020

Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving
Yanliang Jin ... Liquan Shen
Symmetry | VOL. 13
Yanliang Jin, et. al.Yanliang Jin ... Liquan Shen
12 Jun 2021
Symmetry | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lane Following Method Based on Improved DDPG Algorithm.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)