‘I don’t want to play with you anymore’: dynamic partner judgements in moody reinforcement learners playing the prisoner’s dilemma

Grace Feehan,Shaheen Fatima

doi:10.1017/s0269888924000018

Abstract

Abstract Emerging reinforcement learning algorithms that utilize human traits as part of their conceptual architecture have been demonstrated to encourage cooperation in social dilemmas when compared to their unaltered origins. In particular, the addition of a mood mechanism facilitates more cooperative behaviour in multi-agent iterated prisoner dilemma (IPD) games, for both static and dynamic network contexts. Mood-altered agents also exhibit humanlike behavioural trends when environmental aspects of the dilemma are altered, such as the structure of the payoff matrix used. It is possible that other environmental effects from both human and agent-based research will interact with moody structures in previously unstudied ways. As the literature on these interactions is currently small, we seek to expand on previous research by introducing two more environmental dimensions; voluntary interaction in dynamic networks, and stability of interaction through varied network restructuring. From an initial Erdos–Renyi random network, we manipulate the structure of a network IPD according to existing methodology in human-based research, to investigate possible replication of their findings. We also facilitated strategic selection of opponents through the introduction of two partner evaluation mechanisms and tested two selection thresholds for each. We found that even minimally strategic play termination in dynamic networks is enough to enhance cooperation above a static level, though the thresholds for these strategic decisions are critical to desired outcomes. More forgiving thresholds lead to better maintenance of cooperation between kinder strategies than stricter ones, despite overall cooperation levels being relatively low. Additionally, moody reinforcement learning combined with certain play termination decision strategies can mimic trends in human cooperation affected by structural changes to the IPD played on dynamic networks—as can kind and simplistic strategies such as Tit-For-Tat. Implications of this in comparison with human data is discussed, and suggestions for diversification of further testing are made.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

‘I don’t want to play with you anymore’: dynamic partner judgements in moody reinforcement learners playing the prisoner’s dilemma

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review

Lead the way for us

Journal: The Knowledge Engineering Review	Publication Date: Jan 1, 2024
License type: CC BY 4.0

Similar Papers

Experimental and theoretical investigations of the emergence and sustenance of prosocial behavior in groups
Katrin Fehl
-
Katrin FehlKatrin Fehl
20 Feb 2022
20 Feb 2022

Human cooperation in social dilemmas: comparing the Snowdrift game with the Prisoner's Dilemma
Rolf Kümmerli ... Flavien Russier
Proceedings of the Royal Society B: Biological Sciences | VOL. 274
Rolf Kümmerli, et. al.Rolf Kümmerli ... Flavien Russier
25 Sep 2007
Proceedings of the Royal Society B: Biological Sciences | VOL. 274

Spatial evolutionary game theory: Hawks and Doves revisited
...
Proceedings of the Royal Society of London. Series B: Biological Sciences | VOL. 263
, et. al. ...
22 Sep 1996
Proceedings of the Royal Society of London. Series B: Biological Sciences | VOL. 263

Evolution of cooperative behavior in a spatial iterated prisoner's dilemma game with different representation schemes of game strategies
Hisao Ishibuchi ... Hiroyuki Ohyanagi
-
Hisao Ishibuchi, et. al.Hisao Ishibuchi ... Hiroyuki Ohyanagi
01 Aug 2009
01 Aug 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

‘I don’t want to play with you anymore’: dynamic partner judgements in moody reinforcement learners playing the prisoner’s dilemma

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review