Habits, action sequences and reinforcement learning

Amir Dezfouli,Bernard W Balleine

doi:10.1111/j.1460-9568.2012.08050.x

Amir Dezfouli, Bernard W Balleine

Open Access

https://doi.org/10.1111/j.1460-9568.2012.08050.x

Copy DOI

Abstract

It is now widely accepted that instrumental actions can be either goal-directed or habitual; whereas the former are rapidly acquired and regulated by their outcome, the latter are reflexive, elicited by antecedent stimuli rather than their consequences. Model-based reinforcement learning (RL) provides an elegant description of goal-directed action. Through exposure to states, actions and rewards, the agent rapidly constructs a model of the world and can choose an appropriate action based on quite abstract changes in environmental and evaluative demands. This model is powerful but has a problem explaining the development of habitual actions. To account for habits, theorists have argued that another action controller is required, called model-free RL, that does not form a model of the world but rather caches action values within states allowing a state to select an action based on its reward history rather than its consequences. Nevertheless, there are persistent problems with important predictions from the model; most notably the failure of model-free RL correctly to predict the insensitivity of habitual actions to changes in the action-reward contingency. Here, we suggest that introducing model-free RL in instrumental conditioning is unnecessary, and demonstrate that reconceptualizing habits as action sequences allows model-based RL to be applied to both goal-directed and habitual actions in a manner consistent with what real animals do. This approach has significant implications for the way habits are currently investigated and generates new experimental predictions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Habits, action sequences and reinforcement learning

Abstract

Talk to us

Similar Papers

More From: European Journal of Neuroscience

Lead the way for us

Journal: European Journal of Neuroscience	Publication Date: Apr 1, 2012
Citations: 323

Similar Papers

Actions, Action Sequences and Habits: Evidence That Goal-Directed and Habitual Action Control Are Hierarchically Organized
Amir Dezfouli ... Bernard W Balleine
PLoS Computational Biology | VOL. 9
Amir Dezfouli, et. al.Amir Dezfouli ... Bernard W Balleine
05 Dec 2013
PLoS Computational Biology | VOL. 9

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Cheng Gao ... Dan Wang
Journal of Building Engineering | VOL. 74
Cheng Gao, et. al.Cheng Gao ... Dan Wang
01 Sep 2023
Journal of Building Engineering | VOL. 74

Accelerating Model-Free Reinforcement Learning With Imperfect Model Knowledge in Dynamic Spectrum Access
Lianjun Li ... Hao-Hsuan Chang
IEEE Internet of Things Journal | VOL. 7
Lianjun Li, et. al.Lianjun Li ... Hao-Hsuan Chang
01 Aug 2020
IEEE Internet of Things Journal | VOL. 7

Frontoparietal network activity during model-based reinforcement learning updates is reduced among adolescents with severe sexual abuse
Allison M Letkiewicz ... Josh M Cisler
Journal of Psychiatric Research | VOL. 145
Allison M Letkiewicz, et. al.Allison M Letkiewicz ... Josh M Cisler
04 Nov 2020
Journal of Psychiatric Research | VOL. 145

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Habits, action sequences and reinforcement learning

Abstract

Talk to us

Similar Papers

More From: European Journal of Neuroscience