Reinforcement Learning Approach to Design Practical Adaptive Control for a Small-Scale Intelligent Vehicle

Bo Hu,Shuang Li,Youchang Sun,Xiaoyu Yang,Jiaxi Li,Haitao Bai,Jie Yang

doi:10.3390/sym11091139

Bo Hu, Shuang Li + Show 5 more

Open Access

https://doi.org/10.3390/sym11091139

Copy DOI

Journal: Symmetry	Publication Date: Sep 7, 2019
Citations: 21	License type: CC BY 4.0

Affiliation: Chongqing University of Technology, Tianjin University

Abstract

Reinforcement learning (RL) based techniques have been employed for the tracking and adaptive cruise control of a small-scale vehicle with the aim to transfer the obtained knowledge to a full-scale intelligent vehicle in the near future. Unlike most other control techniques, the purpose of this study is to seek a practical method that enables the vehicle, in the real environment and in real time, to learn the control behavior on its own while adapting to the changing circumstances. In this context, it is necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. Meanwhile, in order to realize adaptive cruise control specifically, a set of symmetrical control actions consisting of steering angle and vehicle speed needs to be optimized simultaneously. In this paper, firstly, the experimental setup of the small-scale intelligent vehicle is introduced. Subsequently, three model-free RL algorithm are conducted to develop and finally form the strategy to keep the vehicle within its lanes at constant and top velocity. Furthermore, a model-based RL strategy is compared that incorporates learning from real experience and planning from simulated experience. Finally, a Q-learning based adaptive cruise control strategy is intermixed to the existing tracking control architecture to allow the vehicle slow-down in the curve and accelerate on straightaways. The experimental results show that the Q-learning and Sarsa (λ) algorithms can achieve a better tracking behavior than the conventional Sarsa, and Q-learning outperform Sarsa (λ) in terms of computational complexity. The Dyna-Q method performs similarly with the Sarsa (λ) algorithms, but with a significant reduction of computational time. Compared with a fine-tuned proportion integration differentiation (PID) controller, the good-balanced Q-learning is seen to perform better and it can also be easily applied to control problems with over one control actions.

Highlights

Self-driving vehicles—which incorporate multiple complex systems to sense the surrounding environment, plan a path to a destination, and control steering and speed—have grown rapidly in the last few years [1,2]
The experimental results show that the Q-learning and Sarsa (λ) algorithms can achieve a better tracking behavior than the conventional Sarsa, and Q-learning outperform Sarsa (λ) in terms of computational complexity
Compared with a fine-tuned proportion integration differentiation (PID) controller, the good-balanced Q-learning is seen to perform better and it can be applied to control problems with over one control actions

Summary

Introduction

Self-driving vehicles—which incorporate multiple complex systems to sense the surrounding environment, plan a path to a destination, and control steering and speed—have grown rapidly in the last few years [1,2]. One barrier to the academics and industry who wish to develop and test their intelligent control algorithm is the massive expense of the full-scale vehicles [4], not to mention the expense of constructing the test site in order to provide a safe, controlled environment for the testing of self-driving vehicles (for example, University of Michigan has spent $10 million developing an entire 32-acre mock city, Mcity, in order to serve as a providing ground for their intelligent vehicles [5])

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning Approach to Design Practical Adaptive Control for a Small-Scale Intelligent Vehicle

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Cascade Control Method of Sliding Mode and PID for PEMFC Air Supply System
Aihua Tang ... Quanqing Yu
Energies | VOL. 16
Aihua Tang, et. al.Aihua Tang ... Quanqing Yu
25 Dec 2022
Energies | VOL. 16

Optimization and Realization of the Coordination Control Strategy for Extended Range Electric Vehicle
Keqin Zhao ... Yunhua Zhang
Machines | VOL. 10
Keqin Zhao, et. al.Keqin Zhao ... Yunhua Zhang
22 Apr 2022
Machines | VOL. 10

Design of a self-adaptive fuzzy PID controller for piezoelectric ceramics micro-displacement system
Shuang Zhang ... Yuning Zhong
-
Shuang Zhang, et. al.Shuang Zhang ... Yuning Zhong
03 Dec 2008
03 Dec 2008

Fuzzy adaptive PID control for path tracking of field intelligent weeding machine
Jiaodi Liu ... Longzhe Quan
AIP Advances | VOL. 14
Jiaodi Liu, et. al.Jiaodi Liu ... Longzhe Quan
01 Mar 2024
AIP Advances | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning Approach to Design Practical Adaptive Control for a Small-Scale Intelligent Vehicle

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry