Multi-Agent Reinforcement Learning for the Energy Optimization of Cyber-Physical Production Systems

Jupiter Bakakeu,Schirin Baer,Matthias Brossog,Joern Peschke,Joerg Franke,Hans-Henning Klos

doi:10.1007/978-3-030-61045-6_11

Abstract

This chapter proposes an artificial intelligence based solution for the efficient operation of a heterogeneous cluster of flexible manufacturing machines with energy generation and storage capabilities in an electricity micro-grid featuring a high volatility of electricity prices. The problem of finding the optimal control policy is first formulated as a game theoretic sequential decision making problem under uncertainty, where at every time step the uncertainty is characterized by future weather dependent energy prices, high demand fluctuation, as well as random unexpected disturbances on the factory floor. Because of the parallel interaction of the machines with the grid, the local viewpoints of an agent are non-stationary and non-Markovian. Therefore, traditional methods such as standard reinforcement learning approaches that learn a specialized policy for a single machine are not applicable. To address this problem, we propose a multi-agent actor-critic method that takes into account the policies of other participants to achieve explicit coordination between a large numbers of actors. We show the strength of our approach in mixed cooperative and competitive scenarios where different production machines were able to discover different coordination strategies in order to increase the energy efficiency of the whole factory floor.

Full Text