A significant amount of society’s infrastructure can be modeled using graph structures, from electric and communication grids, to traffic networks, to social networks. Each of these domains are also susceptible to unwanted cascades, whether this be overloaded devices in the power grid or the reach of a social media post containing misinformation. The potential harm of a cascade is compounded when considering a malicious attack by an adversary that is intended to maximize the cascade impact. However, by exploiting knowledge of the cascading dynamics, targets with the largest cascading impact can be preemptively prioritized for defense, and the damage an adversary can inflict can be mitigated. While game theory provides tools for finding an optimal preemptive defense strategy, existing methods struggle to scale to the context of large graph environments because of the combinatorial explosion of possible actions with the size of the graph. Data-driven deep learning approaches can fill this gap, but they demand a large amount of data that is often costly to obtain. We propose a novel data generation approach for the cascading failure security problem that uses counterfactual data augmentation and a sub-game data querying strategy to generate both high quality and a high quantity of training data. We demonstrate that our data generation approach creates up to an order of magnitude more data than naïve querying with double the efficiency. We also introduce a deep learning model based embedding node pairs, and train it on our generated data, showing that it effectively defends from cascades on graphs as large as 1000 nodes. Our deep learning approach achieves a closer approximation to the Nash Equilibrium than existing heuristic methods, for graphs where the ground-truth equilibrium can be calculated.
Read full abstract