Performance Of Reinforcement Learning Research Articles

Existing optimal strategies for fast-charging electric vehicle batteries are predominantly at the cell level. The proposed high-fidelity methods for extending available cell models to pack level are associated with a large computational burden, making real-world implementation impossible. Further, fast charging optimization and thermal management problems are dependent. That is, the cooling system reduces battery temperature, allowing for higher charging current, while optimal current minimizes the need for cooling and, in turn, reduces thermal system power consumption. There is a lack of studies where fast charging optimization and battery thermal management problems are jointly solved. Therefore, this paper proposes a simulation study using a deep reinforcement learning (RL) approach that concurrently solves fast charging and thermal management problems for a battery pack with low computational complexity. In this regard, we formulate each cell using an electro-thermal-aging model, which accounts for the heat exchange between adjacent cells. The electro-thermal-aging model plays the role of the environment for RL and is not the focus of novelty in this work. The RL agent is then trained to output the optimal charging current and coolant mass flow rate. Moreover, the proposed methodology is examined through a numerical study where the outperformance of our model is showcased by comparing it with baseline algorithms of model predictive control (MPC) and CC–CV. Consequently, three battery packs comprising 20, 444, and 7104 cells are used, respectively. We demonstrate that RL requires less than a second to finish the simulation for 20 cells, while MPC requires more than 80 min. In addition, RL keeps the cells’ core temperature below 33 °C, but MPC results reach 40 °C. The RL performance in charging packs with 444 and 7104 cells is then compared with that of CC–CV. In terms of computation time, RL and CC–CV are nearly the same, while regarding the average core and surface temperature of the cells as well as the cell aging, RL attains better outcomes, extending the battery pack’s life by up to two years after 1000 fast charging cycles.

Read full abstract

Factory layout planning aims at finding an optimized layout configuration under consideration of varying influences such as the material flow characteristics. Manual layout planning can be characterized as a complex decision-making process due to a large number of possible placement options. Automated planning approaches aim at reducing the manual planning effort by generating optimized layout variants in the early stages of layout planning. Recent developments have introduced deep Reinforcement Learning (RL) based planning approaches that allow to optimize a layout under consideration of a single optimization criterion. However, within layout planning, multiple partially conflicting planning objectives have to be considered. Such multiple objectives are not considered by existing RL-based approaches. This paper addresses this research gap by presenting a novel deep RL-based layout planning approach that allows consideration of multiple objectives for optimization. Furthermore, existing RL-based planning approaches only consider analytically formulated objectives such as the transportation distance. Consequently, dynamic influences in the material flow are neglected which can result in higher operational costs of the future factory. To address this issue, a discrete event simulation module is developed that allows simulating manufacturing and material flow processes simultaneously for any layout configuration generated by the RL approach. Consequently, the presented approach considers material flow simulation results for multi-objective optimization. To investigate the capabilities of RL-based factory layout planning, different RL architectures are compared based on a simplified application scenario. Throughput time, media supply, and material flow clarity are considered as optimization objectives. The best performing architecture is then applied to an exemplary application scenario and compared with the results obtained by a combined version of the genetic algorithm and tabu search, the non-dominated sorting genetic algorithm, and the optimal solution. Finally, two industrial planning scenarios, one focusing on brownfield and one on greenfield planning, are considered. The results show that the performance of RL compared to meta-heuristics depends on the considered computation time. With time the results generated by the RL approach exceed the quality of the best conventional solution by up to 11%. Finally, the potential of applying transfer learning is investigated for three different application scenarios. It is observed that RL can learn generalized patterns for factory layout planning, which allows to significantly reduce the required training time and can lead to improved solution quality. Thus, the use of pre-trained RL models shows a substantial performance potential for automated factory layout planning which cannot be achieved with conventional automated planning approaches.

Read full abstract

Performance Of Reinforcement Learning Research Articles

Related Topics

Articles published on Performance Of Reinforcement Learning

SCORE: Skill-Conditioned Online Reinforcement Learning

Optimizing a Dynamic Vehicle Routing Problem with Deep Reinforcement Learning: Analyzing State-Space Components

Alleviating imbalanced problems of reinforcement learning when applying in real-time power network dispatching and control

Deep reinforcement learning based fast charging and thermal management optimization of an electric vehicle battery pack

Compositional design of multicomponent alloys using reinforcement learning

Transferable multi-objective factory layout planning using simulation-based deep reinforcement learning

Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning

Solving Partially Observable 3D-Visual Tasks with Visual Radial Basis Function Network and Proximal Policy Optimization

Human-Guided Reinforcement Learning With Sim-to-Real Transfer for Autonomous Navigation.

Reinforcement Learning versus Model Predictive Control on greenhouse climate control

Reinforcement learning in robotic motion planning by combined experience-based planning and self-imitation learning

Curiosity-tuned experience replay for wargaming decision modeling without reward-engineering

Asynchronous Deep Double Dueling Q-learning for trading-signal execution in limit order book markets.

Improvement of Reinforcement Learning With Supermodularity.

Impact of network settings on reinforcement learning based caching policy in cooperative edge networks

강화학습 기반 화학 공정 제어 성능 향상을 위한 보상 함수 시뮬레이션 연구

Switching to online: Testing the validity of supervised remote testing for online reinforcement learning experiments.

Medium-term Capacity Management through Reinforcement Learning – Literature review and concept for an industrial pilot-application

Federated reinforcement learning: techniques, applications, and open challenges

Predicting Attention-Shaping Response in People With Schizophrenia.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Performance Of Reinforcement Learning Research Articles

Related Topics

Articles published on Performance Of Reinforcement Learning

SCORE: Skill-Conditioned Online Reinforcement Learning

Optimizing a Dynamic Vehicle Routing Problem with Deep Reinforcement Learning: Analyzing State-Space Components

Alleviating imbalanced problems of reinforcement learning when applying in real-time power network dispatching and control

Deep reinforcement learning based fast charging and thermal management optimization of an electric vehicle battery pack

Compositional design of multicomponent alloys using reinforcement learning

Transferable multi-objective factory layout planning using simulation-based deep reinforcement learning

Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning

Solving Partially Observable 3D-Visual Tasks with Visual Radial Basis Function Network and Proximal Policy Optimization

Human-Guided Reinforcement Learning With Sim-to-Real Transfer for Autonomous Navigation.

Reinforcement Learning versus Model Predictive Control on greenhouse climate control

Reinforcement learning in robotic motion planning by combined experience-based planning and self-imitation learning

Curiosity-tuned experience replay for wargaming decision modeling without reward-engineering

Asynchronous Deep Double Dueling Q-learning for trading-signal execution in limit order book markets.

Improvement of Reinforcement Learning With Supermodularity.

Impact of network settings on reinforcement learning based caching policy in cooperative edge networks

강화학습 기반 화학 공정 제어 성능 향상을 위한 보상 함수 시뮬레이션 연구

Switching to online: Testing the validity of supervised remote testing for online reinforcement learning experiments.

Medium-term Capacity Management through Reinforcement Learning – Literature review and concept for an industrial pilot-application

Federated reinforcement learning: techniques, applications, and open challenges

Predicting Attention-Shaping Response in People With Schizophrenia.