Computing a pessimistic Stackelberg equilibrium with multiple followers: the mixed-pure case
Computing a pessimistic Stackelberg equilibrium with multiple followers: the mixed-pure case
The search problem of computing a Stackelberg (or leader-follower) equilibrium (also referred to as an optimal strategy to commit to) has been widely investigated in the scientific literature in, almost exclusively, the single follower setting. Although the optimistic and pessimistic versions of the problem, i.e., those where the single follower breaks any ties among multiple equilibria either in favour or against the leader, are solved with different methodologies, both cases allow for efficient, polynomial-time algorithms based on linear programming. The situation is different with multiple followers, where results are only sporadic and depend strictly on the nature of the followers’ game.
In this paper, we investigate the setting of a normal-form game with a single leader and multiple followers who, after observing the leader’s commitment, play a Nash equilibrium. When both leader and followers are allowed to play mixed strategies, the corresponding search problem, both in the optimistic and pessimistic versions, is known to be inapproximable in polynomial time to within any multiplicative polynomial factor unless P = NP. Exact algorithms are known only for the optimistic case. We focus on the case where the followers play pure strategies—a restriction that applies to a number of real-world scenarios and which, in principle, makes the problem easier—under the assumption of pessimism (the optimistic version of the problem can be straightforwardly solved in polynomial time). After casting this search problem (with followers playing pure strategies) as a pessimistic bilevel programming problem, we show that, with two followers, the problem is NP-hard and, with three or more followers, it cannot be approximated in polynomial time to within any multiplicative factor which is polynomial in the size of the normal-form game, nor, assuming utilities in [0, 1], to within any constant additive loss strictly smaller than 1 unless P = NP. This shows that, differently from what happens in the optimistic version, hardness and inapproximability in the pessimistic problem are not due to the adoption of mixed strategies. We then show that the problem admits, in the general case, a supremum but not a maximum, and we propose a single-level mathematical programming reformulation which asks for the maximization of a nonconcave quadratic function over an unbounded nonconvex feasible region defined by linear and quadratic constraints. Since, due to admitting a supremum but not a maximum, only a restricted version of this formulation can be solved to optimality with state-of-the-art methods, we propose an exact ad hoc algorithm (which we also embed within a branch-and bound scheme) capable of computing the supremum of the problem and, for cases where there is no leader’s strategy where such value is attained, also an α-approximate strategy where α > 0 is an arbitrary additive loss (at most as large as the supremum). We conclude the paper by evaluating the scalability of our algorithms via computational experiments on a well-established testbed of game instances.
Coniglio, Stefano
03838248-2ce4-4dbc-a6f4-e010d6fdac67
Marchesi, Alberto
d75af98a-e386-4dc6-839b-93e68c462508
Gatti, Nicola
01e1f32c-edda-4516-8d69-7f93a3525a0d
Coniglio, Stefano
03838248-2ce4-4dbc-a6f4-e010d6fdac67
Marchesi, Alberto
d75af98a-e386-4dc6-839b-93e68c462508
Gatti, Nicola
01e1f32c-edda-4516-8d69-7f93a3525a0d
Coniglio, Stefano, Marchesi, Alberto and Gatti, Nicola
(2019)
Computing a pessimistic Stackelberg equilibrium with multiple followers: the mixed-pure case.
Algorithmica.
(doi:10.1007/s00453-019-00648-8).
Abstract
The search problem of computing a Stackelberg (or leader-follower) equilibrium (also referred to as an optimal strategy to commit to) has been widely investigated in the scientific literature in, almost exclusively, the single follower setting. Although the optimistic and pessimistic versions of the problem, i.e., those where the single follower breaks any ties among multiple equilibria either in favour or against the leader, are solved with different methodologies, both cases allow for efficient, polynomial-time algorithms based on linear programming. The situation is different with multiple followers, where results are only sporadic and depend strictly on the nature of the followers’ game.
In this paper, we investigate the setting of a normal-form game with a single leader and multiple followers who, after observing the leader’s commitment, play a Nash equilibrium. When both leader and followers are allowed to play mixed strategies, the corresponding search problem, both in the optimistic and pessimistic versions, is known to be inapproximable in polynomial time to within any multiplicative polynomial factor unless P = NP. Exact algorithms are known only for the optimistic case. We focus on the case where the followers play pure strategies—a restriction that applies to a number of real-world scenarios and which, in principle, makes the problem easier—under the assumption of pessimism (the optimistic version of the problem can be straightforwardly solved in polynomial time). After casting this search problem (with followers playing pure strategies) as a pessimistic bilevel programming problem, we show that, with two followers, the problem is NP-hard and, with three or more followers, it cannot be approximated in polynomial time to within any multiplicative factor which is polynomial in the size of the normal-form game, nor, assuming utilities in [0, 1], to within any constant additive loss strictly smaller than 1 unless P = NP. This shows that, differently from what happens in the optimistic version, hardness and inapproximability in the pessimistic problem are not due to the adoption of mixed strategies. We then show that the problem admits, in the general case, a supremum but not a maximum, and we propose a single-level mathematical programming reformulation which asks for the maximization of a nonconcave quadratic function over an unbounded nonconvex feasible region defined by linear and quadratic constraints. Since, due to admitting a supremum but not a maximum, only a restricted version of this formulation can be solved to optimality with state-of-the-art methods, we propose an exact ad hoc algorithm (which we also embed within a branch-and bound scheme) capable of computing the supremum of the problem and, for cases where there is no leader’s strategy where such value is attained, also an α-approximate strategy where α > 0 is an arbitrary additive loss (at most as large as the supremum). We conclude the paper by evaluating the scalability of our algorithms via computational experiments on a well-established testbed of game instances.
Text
paper
- Accepted Manuscript
More information
Accepted/In Press date: 5 October 2019
e-pub ahead of print date: 14 November 2019
Identifiers
Local EPrints ID: 434827
URI: http://eprints.soton.ac.uk/id/eprint/434827
ISSN: 0178-4617
PURE UUID: d5782cf3-c3ca-4b13-a350-eb7f93653608
Catalogue record
Date deposited: 11 Oct 2019 16:30
Last modified: 17 Mar 2024 03:40
Export record
Altmetrics
Contributors
Author:
Alberto Marchesi
Author:
Nicola Gatti
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics