Reinforcement Learning Parameters Research Articles

Major depressive disorder is prevalent and impairing. Parsing neurocomputational substrates of reinforcement learning in individuals with depression may facilitate a mechanistic understanding of the disorder and suggest new cognitive therapeutic targets. To determine associations among computational model-derived reinforcement learning parameters, depression symptoms, and symptom changes after treatment. In this mixed cross-sectional-cohort study, individuals performed reward and loss variants of a probabilistic learning task during functional magnetic resonance imaging at baseline and follow-up. A volunteer sample with and without a depression diagnosis was recruited from the community. Participants were assessed from July 2011 to February 2017, and data were analyzed from May 2017 to May 2021. Computational model-based analyses of participants' choices assessed a priori hypotheses about associations between components of reward-based and loss-based learning with depression symptoms. Changes in both learning parameters and symptoms were then assessed in a subset of participants who received cognitive behavioral therapy (CBT). Of 101 included adults, 69 (68.3%) were female, and the mean (SD) age was 34.4 (11.2) years. A total of 69 participants with a depression diagnosis and 32 participants without a depression diagnosis were included at baseline; 48 participants (28 with depression who received CBT and 20 without depression) were included at follow-up (mean [SD] of 115.1 [15.6] days). Computational model-based analyses of behavioral choices and neural data identified associations of learning with symptoms during reward learning and loss learning, respectively. During reward learning only, anhedonia (and not negative affect or arousal) was associated with model-derived learning parameters (learning rate: posterior mean regression β = -0.14; 95% credible interval [CrI], -0.12 to -0.03; outcome sensitivity: posterior mean regression β = 0.18; 95% CrI, 0.02 to 0.37) and neural learning signals (moderation of association between striatal prediction error and expected value signals: t97 = -2.10; P = .04). During loss learning only, negative affect (and not anhedonia or arousal) was associated with learning parameters (outcome shift: posterior mean regression β = -0.11; 95% CrI, -0.20 to -0.01) and disrupted neural encoding of learning signals (association with subgenual anterior cingulate prediction error signals: r = -0.28; P = .005). Symptom improvement following CBT was associated with normalization of learning parameters that were disrupted at baseline (reward learning rate: posterior mean regression β = 0.15; 90% CrI, 0.001 to 0.41; loss outcome shift: posterior mean regression β = 0.42; 90% CrI, 0.09 to 0.77). In this study, the mapping of reinforcement learning components to symptoms of major depression revealed mechanistic features associated with these symptoms and points to possible learning-based therapeutic processes and targets.

Read full abstract

Context. Machine learning is one of the actively developing areas of data processing. Reinforcement learning is a class of machine learning methods where the problem involves mapping the sequence of environmental states to agent’s actions. Significant progress in this area has been achieved using DQN-algorithms, which became one of the first classes of stable algorithms for learning using deep neural networks. The main disadvantage of this approach is the rapid growth of RAM in real-world tasks. The approach proposed in this paper can partially solve this problem. Objective. The aim is to develop a method of forming the structure and nature of access to the sparse distributed memory with increased information content to improve reinforcement learning without additional memory. Method. A method of forming the structure and modification of sparse distributed memory for storing previous transitions of the actor in the form of prototypes is proposed. The method allows increasing the informativeness of the stored data and, as a result, to improve the process of creating a model of the studied process by intensifying the learning of the deep neural network. Increasing the informativeness of the stored data is the result of this sequence of actions. First, we compare the new transition and the last saved transition. To perform this comparison, this method introduces a rate estimate for the distance between transitions. If the distance between the new transition and the last saved transition is smaller than the specified threshold, the new transition is written in place of the previous one without increasing the amount of memory. Otherwise, we create a new prototype in memory while deleting the prototype that has been stored in memory the longest. Results. The work of the proposed method was studied during the solution of the popular “Water World” test problem. The results showed a 1.5-times increase in the actor’s survival time in a hostile environment. This result was achieved by increasing the informativeness of the stored data without increasing the amount of RAM. Conclusions. The proposed method of forming and modifying the structure of sparse distributed memory allowed to increase the informativeness of the stored data. As a result of this approach, improved reinforcement learning parameters on the example of the “Water World” problem by increasing the accuracy of the model of the physical process represented by a deep neural network.

Read full abstract

Reinforcement Learning Parameters Research Articles

Related Topics

Articles published on Reinforcement Learning Parameters

Regulation of reinforcement learning parameters captures long-term changes in rat behaviour.

Tuning Reinforcement Learning Parameters for Cluster Selection to Enhance Evolutionary Algorithms.

5-HT 2A and 5-HT 2C receptor antagonism differentially modulate reinforcement learning and cognitive flexibility: behavioural and computational evidence

DQN and dynamic feedback for multitask scheduling optimization in engineering management

Harnessing the flexibility of neural networks to predict dynamic theoretical parameters underlying human choice behavior.

A series of unfortunate events: Do those who catastrophize learn more after negative outcomes?

Test-retest reliability of reinforcement learning parameters.

Reliability of gamified reinforcement learning in densely sampled longitudinal assessments.

Graph neural network architecture search for rotating machinery fault diagnosis based on reinforcement learning

A core component of psychological therapy causes adaptive changes in computational learning mechanisms.

Stimulating human prefrontal cortex increases reward learning

Hosting Capacity Assessment Strategies and Reinforcement Learning Methods for Coordinated Voltage Control in Electricity Distribution Networks: A Review

Feedback control approaches for restoration of power grids from blackouts

Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy

Impaired Learning From Negative Feedback in Stimulant Use Disorder: Dopaminergic Modulation.

Reinforcement learning for the traveling salesman problem with refueling

Design of model-free reinforcement learning control for tunable vibration absorber system based on magnetorheological elastomer

DEEP REINFORCEMENT LEARNING WITH SPARSE DISTRIBUTED MEMORY FOR “WATER WORLD” PROBLEM SOLVING

Individual differences in experienced and observational decision-making illuminate interactions between reinforcement learning and declarative memory

Autonomic responses to choice outcomes: Links to task performance and reinforcement-learning parameters

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning Parameters Research Articles

Related Topics

Articles published on Reinforcement Learning Parameters

Regulation of reinforcement learning parameters captures long-term changes in rat behaviour.

Tuning Reinforcement Learning Parameters for Cluster Selection to Enhance Evolutionary Algorithms.

5-HT 2A and 5-HT 2C receptor antagonism differentially modulate reinforcement learning and cognitive flexibility: behavioural and computational evidence

DQN and dynamic feedback for multitask scheduling optimization in engineering management

Harnessing the flexibility of neural networks to predict dynamic theoretical parameters underlying human choice behavior.

A series of unfortunate events: Do those who catastrophize learn more after negative outcomes?

Test-retest reliability of reinforcement learning parameters.

Reliability of gamified reinforcement learning in densely sampled longitudinal assessments.

Graph neural network architecture search for rotating machinery fault diagnosis based on reinforcement learning

A core component of psychological therapy causes adaptive changes in computational learning mechanisms.

Stimulating human prefrontal cortex increases reward learning

Hosting Capacity Assessment Strategies and Reinforcement Learning Methods for Coordinated Voltage Control in Electricity Distribution Networks: A Review

Feedback control approaches for restoration of power grids from blackouts

Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy

Impaired Learning From Negative Feedback in Stimulant Use Disorder: Dopaminergic Modulation.

Reinforcement learning for the traveling salesman problem with refueling

Design of model-free reinforcement learning control for tunable vibration absorber system based on magnetorheological elastomer

DEEP REINFORCEMENT LEARNING WITH SPARSE DISTRIBUTED MEMORY FOR “WATER WORLD” PROBLEM SOLVING

Individual differences in experienced and observational decision-making illuminate interactions between reinforcement learning and declarative memory

Autonomic responses to choice outcomes: Links to task performance and reinforcement-learning parameters