Reports an error in "The role of memory in counterfactual valuation" by Natalie Biderman, Samuel J. Gershman and Daphna Shohamy (Journal of Experimental Psychology: General, 2023[Jun], Vol 152[6], 1754-1767). In this article, several corrections have been made to two equations, the text, and Figure 3. First, there was an error in two equations of the policy gradient model depicted in the Model Description section. The correction did not alter the main conclusion of the model, but it did change slightly the comparison between experimental conditions of each model depicted in the Results section. The correct Equation 3 and correct Equation 4 are present in the erratum. Additionally, the last four sentences in the first paragraph of the A Policy Gradient Model Captures the Memory-Based Inverse Decision Bias have been revised. Finally, the model predictions shown in the gray bars of Figure 3 were slightly modified. The online version of this article has been corrected. (The following abstract of the original article appeared in record 2023-72914-001.) Value-based decisions are often guided by past experience. If a choice led to a good outcome, we are more likely to repeat it. This basic idea is well-captured by reinforcement-learning models. However, open questions remain about how we assign value to options we did not choose and which we therefore never had the chance to learn about directly. One solution to this problem is proposed by policy gradient reinforcement-learning models; these do not require direct learning of value, instead optimizing choices according to a behavioral policy. For example, a logistic policy predicts that if a chosen option was rewarded, the unchosen option would be deemed less desirable. Here, we test the relevance of these models to human behavior and explore the role of memory in this phenomenon. We hypothesize that a policy may emerge from an associative memory trace formed during deliberation between choice options. In a preregistered study (n = 315) we show that people tend to invert the value of unchosen options relative to the outcome of chosen options, a phenomenon we term inverse decision bias. The inverse decision bias is correlated with memory for the association between choice options; moreover, it is reduced when memory formation is experimentally interfered with. Finally, we present a new memory-based policy gradient model that predicts both the inverse decision bias and its dependence on memory. Our findings point to a significant role of associative memory in valuation of unchosen options and introduce a new perspective on the interaction between decision-making, memory, and counterfactual reasoning. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
Read full abstract