Emergence of exploitation as symmetry breaking in iterated prisoner's dilemma

Yuma Fujimoto,Kunihiko Kaneko

doi:10.1103/physrevresearch.1.033077

Yuma Fujimoto, Kunihiko Kaneko

Open Access

https://doi.org/10.1103/physrevresearch.1.033077

Copy DOI

Abstract

In society, mutual cooperation, defection, and asymmetric exploitative relationships are common. Whereas cooperation and defection are studied extensively in the literature on game theory, asymmetric exploitative relationships between players are little explored. In a recent study, Press and Dyson demonstrate that if only one player can learn about the other, asymmetric exploitation is achieved in the prisoner's dilemma game. In contrast, however, it is unknown whether such one-way exploitation is stably established when both players learn about each other symmetrically and try to optimize their payoffs. Here, we first formulate a dynamical system that describes the change in a player's probabilistic strategy with reinforcement learning to obtain greater payoffs, based on the recognition of the other player. By applying this formulation to the standard prisoner's dilemma game, we numerically and analytically demonstrate that an exploitative relationship can be achieved despite symmetric strategy dynamics and symmetric rule of games. This exploitative relationship is stable, even though the exploited player, who receives a lower payoff than the exploiting player, has optimized the own strategy. Whether the final equilibrium state is mutual cooperation, defection, or exploitation, crucially depends on the initial conditions: Punishment against a defector oscillates between the players, and thus a complicated basin structure to the final equilibrium appears. In other words, slight differences in the initial state may lead to drastic changes in the final state. Considering the generality of the result, this study provides a new perspective on the origin of exploitation in society.

Highlights

Equality is not achieved in society; instead, inequality among individuals is common
IV, we investigate the dynamics of the learning process in depth to demonstrate that a slight difference in initial strategies between the players is amplified, and this symmetry breaking results in a large payoff difference, i.e., exploitation
We emphasize that the equilibrium state for a repeated game is denoted by the subscript e, but it is unrelated to the equilibrium of learning dynamics discussed in the following subsection

Summary

INTRODUCTION

Equality is not achieved in society; instead, inequality among individuals is common. With the change in strategies of the players through learning, we check whether “symmetry breaking” can occur when individuals have symmetric capacities and environmental conditions For this analysis, we adopt the celebrated prisoner’s dilemma game. In the prisoner’s dilemma game, the emergence and sustainability of cooperation, even though defection is any individual player’s best choice, has been extensively investigated [1,2]. We study the well-known prisoner’s dilemma (PD) game (see Fig. 1 for the payoff matrix), in which each of two players, referred to as players 1 and 2, chooses to cooperate (C) or defect (D).

Repeated game for fixed strategies

Learning dynamics of strategies

Intuitive interpretation of the model

ANALYSIS OF LEARNING EQUILIBRIUM

Characterization of the exploitative relationship

TRANSIENT DYNAMICS TO THE LEARNING EQUILIBRIUM

Characterization of transient dynamics

Basin structure for exploitative state

SUMMARY AND DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Physical Review Research	Publication Date: Nov 5, 2019
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Emergence of exploitation as symmetry breaking in iterated prisoner's dilemma

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physical Review Research

Lead the way for us

Similar Papers

Emergence of super cooperation of prisoner's dilemma games on scale-free networks.
Angsheng Li ... Xi Yong
PLOS ONE | VOL. 10
Angsheng Li, et. al.Angsheng Li ... Xi Yong
02 Feb 2015
PLOS ONE | VOL. 10

Reply to “Comment on ‘Stochastic dynamics of the prisoner's dilemma with cooperation facilitators’ ”
Mauro Mobilia
Physical Review E | VOL. 88
Mauro MobiliaMauro Mobilia
31 Oct 2013
Physical Review E | VOL. 88

Cooperation in rats playing the iterated Prisoner's Dilemma game
Ruth I Wood ... Grace R Li
Animal Behaviour | VOL. 114
Ruth I Wood, et. al.Ruth I Wood ... Grace R Li
17 Feb 2016
Animal Behaviour | VOL. 114

On the Future Parameter
Nobuo Takahashi
Annals of Business Administrative Science | VOL. 12
Nobuo TakahashiNobuo Takahashi
01 Jan 2013
Annals of Business Administrative Science | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Emergence of exploitation as symmetry breaking in iterated prisoner's dilemma

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physical Review Research