Energy management strategies (EMSs) ease mileage anxiety and improve the health performance of battery electric vehicles (BEVs). This study proposes a soft actor-critic (SAC)-based EMS for dual-motor two-speed BEV to minimize electrical energy consumption and battery capacity losses. In addition, two optimization tricks are employed to optimize the original SAC. The linear mapping trick is integrated into the actor-network of SAC to enable agents to search for optimal EMS in a discrete (driving mode)-continuous (torque distribution) hybrid action space. The SAC-based EMS is modeled as a partially observable Markov decision process (POMDP) to obtain more information about the real state of the environment. Based on this, another optimization trick, the long short-term memory (LSTM) network, is integrated into the actor and critic of SAC to make full use of both historical and current environmental information. Simulation results show that the proposed method learns the optimal EMS, its converged episodes are shortened to the 97th, the health performance of the battery is the closest to the dynamic programming (DP)-based EMS, and maintains battery operating at a range of 30–40°C. In addition, the proposed EMS has the best adaptability in test cycles, reaching 99.42% and 99.29% of DP-based EMS, respectively.
Read full abstract