Prioritized Experience Replay-Based Deep Q Learning: Multiple-Reward Architecture for Highway Driving Decision Making

Wei Yuan,Chunxiang Wang,Hanyang Zhuang,Yueyuan Li,Ming Yang

doi:10.1109/mra.2021.3115980

Wei Yuan, Chunxiang Wang + Show 3 more

Open Access

https://doi.org/10.1109/mra.2021.3115980

Copy DOI

Journal: IEEE Robotics & Automation Magazine	Publication Date: Dec 1, 2021
Citations: 7	License type: publisher-specific, author manuscript

Affiliation: Shanghai Jiao Tong University

Abstract

Decision making is a fundamental component to ensure safe autonomous driving in highway scenarios. The mainstream architecture for this task is the classical deep Q learning network (DQN). However, there remain two major issues with the DQN: 1) because of its traditional experience replay mechanism, the model tends to learn bias from imbalanced data, and 2) for multiobjective tasks, the unitary reward function limits the model to learning representative domain knowledge. To address these problems, this article proposes a DQN model based on a prioritized experience replay (PER) mechanism with a multireward architecture (MRA) for highway driving decision making. For balanced training, the importance of memory samples is encoded with the error between the Q estimation and Q target. For more directional training, a single reward function is decomposed into three minor ones based on prior knowledge, emphasizing speed, overtaking, and lane changing. Experimental results indicate that the proposed prioritized MRA (PMRA) DQN is superior to the traditional DQN, with higher driving speeds, less lane changing, and safer overtaking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prioritized Experience Replay-Based Deep Q Learning: Multiple-Reward Architecture for Highway Driving Decision Making

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics & Automation Magazine

Lead the way for us

Similar Papers

A novel DDPG method with prioritized experience replay
Yuenan Hou ... Lifeng Liu
-
Yuenan Hou, et. al.Yuenan Hou ... Lifeng Liu
01 Oct 2017
01 Oct 2017

DHQN: a Stable Approach to Remove Target Network from Deep Q-learning Network
Guang Yang ... Yang Li
-
Guang Yang, et. al.Guang Yang ... Yang Li
01 Nov 2021
01 Nov 2021

A deep reinforcement learning-based power control scheme for the 5G wireless systems
Renjie Liang ... Jiancun Fan
China Communications | VOL. 20
Renjie Liang, et. al.Renjie Liang ... Jiancun Fan
01 Oct 2023
China Communications | VOL. 20

Data-driven control of wind turbine under online power strategy via deep learning and reinforcement learning
Tenghui Li ... Anastasia Ioannou
Renewable Energy | VOL. 234
Tenghui Li, et. al.Tenghui Li ... Anastasia Ioannou
29 Aug 2024
Renewable Energy | VOL. 234

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prioritized Experience Replay-Based Deep Q Learning: Multiple-Reward Architecture for Highway Driving Decision Making

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics &amp; Automation Magazine

More From: IEEE Robotics & Automation Magazine