Bilevel programming methods for computing single-leader-multi-follower equilibria in normal-form and polymatrix games

Nicola Basilico,Stefano Coniglio,Alberto Marchesi,Nicola Gatti

doi:10.1007/s13675-019-00114-8

Nicola Basilico, Stefano Coniglio + Show 2 more

Open Access

https://doi.org/10.1007/s13675-019-00114-8

Copy DOI

Abstract

The concept of leader-follower (or Stackelberg) equilibrium plays a central role in a number of real-world applications bordering on mathematical optimization and game theory. While the single-follower case has been investigated since the inception of bilevel programming with the seminal work of von Stackelberg, results for the case with multiple followers are only sporadic and not many computationally affordable methods are available. In this work, we consider Stackelberg games with two or more followers who play a (pure or mixed) Nash equilibrium once the leader has committed to a (pure or mixed) strategy, focusing on normal-form and polymatrix games. As customary in bilevel programming, we address the two extreme cases where, if the leader’s commitment originates more Nash equilibria in the followers’ game, one which either maximizes (optimistic case) or minimizes (pessimistic case) the leader’s utility is selected. First, we show that, in both cases and when assuming mixed strategies, the optimization problem associated with the search problem of finding a Stackelberg equilibrium is mathcal {NP}-hard and not in Poly-mathcal {APX} unless mathcal {P} = mathcal {NP}. We then consider different situations based on whether the leader or the followers can play mixed strategies or are restricted to pure strategies only, proposing exact nonconvex mathematical programming formulations for the optimistic case for normal-form and polymatrix games. For the pessimistic problem, which cannot be tackled with a (single-level) mathematical programming formulation, we propose a heuristic black-box algorithm. All the methods and formulations that we propose are thoroughly evaluated computationally.

Highlights

Leader-follower games model the interaction between rational agents when a hierarchical decision-making structure is in place
We focus on the fundamental case of single-leader multi-follower games with a finite number of actions per player where the overall game can be represented as a normal-form or polymatrix game—the latter is of interest as it plays an important role in a number of applications such as in the security domain, where the defender may need to optimize against multiple uncoordinated attackers solely interested in damaging the leader
For the problem of computing an O/P-leader-follower Nash equilibrium (LFNE), we show the following result: Proposition 3 The optimization problem associated with the search problem of computing an O/P-LFNE in the leader in mixed and followers in mixed (LMFM) and leader in pure and followers in mixed (LPFM) cases is N P-hard and it is not in Poly-APX unless P = N P, even when the game is polymatrix

Summary

Introduction

Leader-follower (or Stackelberg) games model the interaction between rational agents (or players) when a hierarchical decision-making structure is in place. The number of real-world problems where a leader-follower (or Stackelberg) structure can be identified is extremely large This is often the case in the security domain (An et al 2011; Kiekintveld et al 2009), where a defender, aiming to protect a set of valuable targets from the attackers, plays first, while the attackers, acting as followers, make their move only after observing the leader’s defensive strategy. We refer to an equilibrium in such games as leader-follower Nash equilibrium (LFNE) As it is typical in bilevel programming, we study two extreme cases: the optimistic one where, if the leader’s commitment originates more NE in the followers’ game, one which maximizes the leader’s utility is selected, and the pessimistic one where an equilibrium which minimizes the leader’s utility is chosen. We conclude by providing a thorough experimental evaluation of our techniques on a (normal-form and polymatrix) test bed generated with GAMUT (Nudelman et al 2004), encompassing some structured games, employing different solvers: BARON, SCIP, CPLEX, SNOPT and RBFOpt. (The latter is used for black-box optimization)

Notation

Previous works

Problem statements

Bilevel programming perspective

Complexity results

O-NF-LMFM-I

O-NF-LMFM-II

O-NF-LMFM-III

O-PM-LMFM-I

O-PM-LMFM-III

Exact formulations for NF and PF games

O-NF-LPFM-III

O-PM-LPFM-III

A note on solution approaches for the remaining cases

Computational results

The optimality gap is defined as min

10-2 MinimumEffortGame RandomGraphicalGame TravelersDilemma

Findings

10 Conclusions and future work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURO Journal on Computational Optimization	Publication Date: May 18, 2019
Citations: 17	License type: open-access

R Discovery Prime

R Discovery Prime

Bilevel programming methods for computing single-leader-multi-follower equilibria in normal-form and polymatrix games

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURO Journal on Computational Optimization

Lead the way for us

Similar Papers

Modelling social evolution: the relative merits and limitations of a Hamilton's rule-based approach
T Wenseleers
Journal of Evolutionary Biology | VOL. 19
T WenseleersT Wenseleers
01 Sep 2006
Journal of Evolutionary Biology | VOL. 19

On the Convergence of Fictitious Play in Channel Selection Games
Victor Quintero-Florez ... Samir Medina Perlaza
IEEE Latin America Transactions | VOL. 9
Victor Quintero-Florez, et. al.Victor Quintero-Florez ... Samir Medina Perlaza
01 Jul 2011
IEEE Latin America Transactions | VOL. 9

On the fictitious play and channel selection games
S Lasaulce ... V Quintero-Florez
-
S Lasaulce, et. al.S Lasaulce ... V Quintero-Florez
01 Sep 2010
01 Sep 2010

On minmax theorems for multiplayer games
...
-
, et. al. ...
23 Jan 2011
23 Jan 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bilevel programming methods for computing single-leader-multi-follower equilibria in normal-form and polymatrix games

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURO Journal on Computational Optimization