Online Learning in Iterated Prisoner’s Dilemma to Mimic Human Behavior

Baihan Lin,Djallel Bouneffouf,Guillermo Cecchi

doi:10.1007/978-3-031-20868-3_10

Abstract

AbstractAs an important psychological and social experiment, the Iterated Prisoner’s Dilemma (IPD) treats the choice to cooperate or defect as an atomic action. We propose to study the behaviors of online learning algorithms in the Iterated Prisoner’s Dilemma (IPD) game, where we investigate the full spectrum of reinforcement learning agents: multi-armed bandits, contextual bandits and reinforcement learning. We evaluate them based on a tournament of iterated prisoner’s dilemma where multiple agents can compete in a sequential fashion. This allows us to analyze the dynamics of policies learned by multiple self-interested independent reward-driven agents, and also allows us study the capacity of these algorithms to fit the human behaviors. Results suggest that considering the current situation to make decision is the worst in this kind of social dilemma game. Multiples discoveries on online learning behaviors and clinical validations are stated, as an effort to connect artificial intelligence algorithms with human behaviors and their abnormal states in neuropsychiatric conditions.KeywordsOnline learningBanditsContextual banditsReinforcement learningIterated Prisoner’s DilemmaBehavioral modeling

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Learning in Iterated Prisoner’s Dilemma to Mimic Human Behavior

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2022
Citations: 10	License type: other-oa

Similar Papers

Evolution of cooperative behavior in a spatial iterated prisoner's dilemma game with different representation schemes of game strategies
Hisao Ishibuchi ... Hiroyuki Ohyanagi
-
Hisao Ishibuchi, et. al.Hisao Ishibuchi ... Hiroyuki Ohyanagi
01 Aug 2009
01 Aug 2009

The resilience of cooperation in a Dilemma game played by reinforcement learning agents
Koichi Moriyama ... Nobuhiro Inuzuka
-
Koichi Moriyama, et. al.Koichi Moriyama ... Nobuhiro Inuzuka
01 Jul 2017
01 Jul 2017

Cooperation in the evolutionary iterated prisoner’s dilemma game with risk attitude adaptation
Weijun Zeng ... Fuzan Chen
Applied Soft Computing | VOL. 44
Weijun Zeng, et. al.Weijun Zeng ... Fuzan Chen
14 Apr 2016
Applied Soft Computing | VOL. 44

Evolution of cooperative behavior among heterogeneous agents with different strategy representations in an iterated prisoner’s dilemma game
Hiroyuki Ohyanagi ... Yusuke Nakashima
Artificial Life and Robotics | VOL. 14
Hiroyuki Ohyanagi, et. al.Hiroyuki Ohyanagi ... Yusuke Nakashima
01 Dec 2009
Artificial Life and Robotics | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Learning in Iterated Prisoner’s Dilemma to Mimic Human Behavior

Abstract

Talk to us

Similar Papers