Abstract
The research on autonomous driving based on deep reinforcement learning algorithms is a research hotspot. Traditional autonomous driving requires human involvement, and the autonomous driving algorithms based on supervised learning must be trained in advance using human experience. To deal with autonomous driving problems, this paper proposes an improved end-to-end deep deterministic policy gradient (DDPG) algorithm based on the convolutional block attention mechanism, and it is called multi-input attention prioritized deep deterministic policy gradient algorithm (MAPDDPG). Both the actor network and the critic network of the model have the same structure with symmetry. Meanwhile, the attention mechanism is introduced to help the vehicles focus on useful environmental information. The experiments are conducted in the open racing car simulator (TORCS)and the results of five experiment runs on the test tracks are averaged to obtain the final result. Compared with the state-of-the-art algorithm, the maximum reward increases from 62,207 to 116,347, and the average speed increases from 135 km/h to 193 km/h, while the number of success episodes to complete a circle increases from 96 to 147. Also, the variance of the distance from the vehicle to the center of the road is compared, and the result indicates that the variance of the DDPG is 0.6 m while that of the MAPDDPG is only 0.2 m. The above results indicate that the proposed MAPDDPG achieves excellent performance.
Highlights
With the rapid development of artificial intelligence technology, the era of automatic driving has come
Considering the shortcomings of previous autonomous driving research and the characteristics of attention mechanism, this paper proposes MAPDDPG that incorporates convolutional block attention [25] module into the policy gradient algorithm [23] to solve the problem of complex and costly manual decision-making processes
A convolutional block attention mechanism is introduced in this paper to extract channel and spatial features of images to make the model focus on the information of important regions
Summary
With the rapid development of artificial intelligence technology, the era of automatic driving has come. Considering the shortcomings of previous autonomous driving research and the characteristics of attention mechanism, this paper proposes MAPDDPG that incorporates convolutional block attention [25] module into the policy gradient algorithm [23] to solve the problem of complex and costly manual decision-making processes. During the training on the open racing car simulator (TORCS), the MAPDDPG model automatically pays attention to the input information that may affect the driving. This makes the learned driving strategies more in line with human driving behavior and contribute to higher safety. Three orbits in the TORCS simulation environment are exploited to evaluate the performance of the MAPDDPG module
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have