Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Emilian Lalev,Maurice Grinberg

doi:10.1007/978-3-540-74262-3_19

Abstract

We compare the performance of two connectionist models developed to account for some specific aspects of the decision making process in the Iterated Prisoner’s Dilemma Game. Both models are based on common recurrent network architecture. The first of them uses a backward-oriented reinforcement learning algorithm for learning to play the game while the second one makes its move decisions based on generated predictions about future games, moves and payoffs. Both models involve prediction of the opponent move and of the expected payoff and have an in-built autoassociator in their architecture aimed at more efficient payoff matrix representation. The results of the simulations show that the model with explicit anticipation about game outcomes could reproduce the experimentally observed dependency of the cooperation rate on the so-called cooperation index thus showing the importance of anticipation in modeling the actual decision making process in human participants. The role of the models’ building blocks and mechanisms is investigated and discussed. Comparisons with experiments with human participants are presented.Keywordsanticipationcooperationdecision-makingrecurrent artificial neural networkreinforcement learning

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Identifying Accurate and Inaccurate Stimulus Relations: Human and Computer Learning
Chris Ninness ... Ruth Anne Rehfeldt
The Psychological Record | VOL. 69
Chris Ninness, et. al.Chris Ninness ... Ruth Anne Rehfeldt
29 Apr 2019
The Psychological Record | VOL. 69

A Connectionist Model of Stimulus Class Formation with a Yes/No Procedure and Compound Stimuli
Angel E Tovar ... Alvaro Torres Chávez
The Psychological Record | VOL. 62
Angel E Tovar, et. al.Angel E Tovar ... Alvaro Torres Chávez
01 Oct 2012
The Psychological Record | VOL. 62

The robustness of the "Raise-The-Stakes” strategy: Coping with exploitation in noisy Prisoner's Dilemma Games
Bram Van Den Bergh ... Siegfried Dewitte
Evolution and Human Behavior | VOL. 27
Bram Van Den Bergh, et. al.Bram Van Den Bergh ... Siegfried Dewitte
25 Jul 2005
Evolution and Human Behavior | VOL. 27

Value-Based Decision Making and Learning as Algorithms Computed by the Nervous System

-

01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Backward vs. Forward-Oriented Decision Making in the Iterated Prisoner’s Dilemma: A Comparison Between Two Connectionist Models

Abstract

Talk to us

Similar Papers