DQN regenerative braking control strategy based on adaptive weight coefficients

Yanli Yin,Fuzhen Wang,Shenpeng Ma,Xinxin Zhang,Sen Zhan,Xuejiang Huang

doi:10.1177/09544070231186200

Yanli Yin, Fuzhen Wang + Show 4 more

https://doi.org/10.1177/09544070231186200

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Aiming at the problems existing in regenerative braking control strategy based on Q-learning which include the dimensional disaster of state and action variables discretization and the return function weight coefficient determined empirically. This paper proposes deep Q-learning network (DQN) regenerative braking control strategy based on adaptive weight coefficients. Firstly, braking performance evaluation indexes are determined which are braking energy recovery efficiency and braking stability coefficient. Then, the state and action variables and return function are constructed respectively. Therein the braking demand power and state of charge ( SOC) are taken as state variables, braking torque proportional coefficient, and weight coefficients are taken as action variables. And return function is formulated by trading off braking energy recovery efficiency and braking stability. Finally, using the MATLAB/Simulink software, the simulation model of real working condition in Yubei district of Chongqing is established. The simulation results show that braking recovery efficiency of the proposed strategy is 7.4% higher than that of Q-learning strategy, and the average braking stability coefficient is decreased by 0.08. The results indicate the proposed strategy can better balance between braking energy recovery efficiency and braking stability than the conventional strategy.

Full Text