Abstract

Mutual relationships, such as cooperation and exploitation, are the basis of human and other biological societies. The foundations of these relationships are rooted in the decision making of individuals, and whether they choose to be selfish or altruistic. How individuals choose their behaviors can be analyzed using a strategy optimization process in the framework of game theory. Previous studies have shown that reference to individuals’ previous actions plays an important role in their choice of strategies and establishment of social relationships. A fundamental question remains as to whether an individual with more information can exploit another who has less information when learning the choice of strategies. Here we demonstrate that a player using a memory-one strategy, who can refer to their own previous action and that of their opponent, can be exploited by a reactive player, who only has the information of the other player, based on mutual adaptive learning. This is counterintuitive because the former has more choice in strategies and can potentially obtain a higher payoff. We demonstrated this by formulating the learning process of strategy choices to optimize the payoffs in terms of coupled replicator dynamics and applying it to the prisoner’s dilemma game. Further, we show that the player using a memory-one strategy, by referring to their previous experience, can sometimes act more generous toward the opponent’s defection, thereby accepting the opponent’s exploitation. Mainly, we found that through adaptive learning, a player with limited information usually exploits the player with more information, leading to asymmetric exploitation.

Highlights

  • Cooperation, defection, and exploitation are important relationships that universally appear in biological and social systems

  • The monumental and the numerous following studies showed that the players with TFT strategies are selected in the optimization process and establish cooperation. This strategy is classified as a reactive strategy [12,13,14,15,16,17] as the player chooses their actions by referring only to their opponent’s previous action

  • We investigated whether players using the memory-one strategy would win the game against opponents using the reactive strategy, by utilizing the extra information provided through the observation of their own previous actions

Read more

Summary

Introduction

Cooperation, defection, and exploitation are important relationships that universally appear in biological and social systems. The choice of strategy, i.e., selfish or altruistic behavior, is important in establishing social relationships Individuals, based on their abilities, sophisticate their strategies through their experiences. The monumental and the numerous following studies showed that the players with TFT strategies are selected in the optimization process and establish cooperation This strategy is classified as a reactive strategy [12,13,14,15,16,17] as the player chooses their actions by referring only to their opponent’s previous action. We investigated whether players using the memory-one strategy would win the game against opponents using the reactive strategy, by utilizing the extra information provided through the observation of their own previous actions.

Formulation of class and strategy
Analysis of repeated game
Learning dynamics of memory-one class
Detailed calculation of learning dynamics
Learning dynamics of other strategies
One-sided learning against a fixed strategy
Mutual learning between memory-one and reactive classes
Analysis of exploitation
Mechanism of one-sided exploitation: self-reference leads to generosity
Generality over different classes
Generality over different payoff matrices
Summary and discussion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.