Abstract
We study the Stackleberg variant of the repeated Sequential Prisoner's Dilemma (SPD). The game goes in two stages, and the two players, the leader and the follower, are asymmetric in both stages. In the first stage of the game, the leader chooses a strategy (for the repeated SPD of the second stage), which is immediately known to the follower. In the second stage, they play repeated SPD: In each round the follower moves after observing the leader's action. Assuming complete rationality, we find some extraordinary properties of this model. (i) The (subgame perfect) equilibrium payoff profile is unique, which lies on the corner of the region predicted by classical folk theorems: It is best for the leader and at the same time worst for the follower, (ii) the leader has simple optimal strategies that are one-step memory and stationary. These features are in great contrast with classical results, where either uniqueness cannot be guaranteed and equilibrium strategies are often quite complicated, or bounded rationality is required. Although full cooperation, i.e., the outcome is always (cooperate, cooperate), is not attainable in our model, at least a half of the optimal social welfare can be guaranteed. We also do a non-equilibrium analysis which makes the usual equilibrium analysis more convincing.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.