The integration of renewable energy into the power grid poses significant challenges for optimization and scheduling of the power system. In recent years, methods based on deep reinforcement learning have surpassed traditional methods on the high complexity and long-term decision-making of power system optimization and scheduling. However, faced with the inherent uncertainty of renewable energy generation and the different optimization objectives in power system, the deep reinforcement learning methods are unable to effectively address them. This paper proposes a method that combines meta reinforcement learning with multi-agent reinforcement learning to solve the multi-objective two-stage robust optimization of wind/PV/thermal power system. We conducts optimization and scheduling experiments on the IEEE39 bus system. The results indicate that our method not only enhances the robustness of the scheduling strategy, but also outperforms baseline methods in terms of convergence, diversity, and uniformity of the Pareto frontier.
Read full abstract