Coordinated dispatch of integrated electricity and thermal system (IETS) provides extra operation flexibility which is further improved by integration of electrical and thermal storages. However, the problem non-convexity and multiple uncertainties hinder its optimal real-time dispatch. This paper proposes an approximate dynamic programming with imitation learning (ADP-IL) based real-time dispatch policy for IETS with electrical and thermal storages, which is computationally efficient and adaptive to uncertainties while satisfying complex networks constraints. First, real-time dispatch of IETS is reformulated as a multistage stochastic sequential optimization and non-convex terms are addressed by mixed integer programming. Next, an ADP which exploits value function monotonicity is employed to temporally decompose original problem and an off-line pre-learning is incorporated to address dimension complexity. Then, imitation learning, which allows the algorithm to learn from expert demonstration, is introduced to further accelerate off-line pre-learning. After sufficient learning, ADP-IL provides high quality real-time solution with remarkable computation efficiency. Comprehensive studies verify optimality, adaptability, efficiency, and scalability of ADP-IL.
Read full abstract