Reinforcement Leaning Research Articles

Planning and decision making are closely interconnected processes that often occur in tandem, influence and informing each other. Planning usually precedes decision making in the chronological sequence, and it can be viewed as a strategy to make decisions. A comprehensive planning or decision strategy can facilitate effective decisions. Thus, understanding and learning human decision-making strategies has drawn intensive attention from the AI community. For example, applying planning algorithms into reinforcement leaning (RL) can simulate the consequence of different actions and select optimal decisions based on learned models, while inverse reinforcement learning (IRL) learns a reward function and policy from expert demonstration and applies them into new scenarios. Most of these methods work based on learning human decision strategies by using modeling of a Markovian decision-making process (MDP). In this paper, we argue that the property of MDP is not fit for human decision-making processes in the real-world and it is insufficient to capture human decision strategies. To tackle this challenge, we propose a new approach to identify the characteristics of human decision-making processes as a decision map, where the decision strategy is defined by the probability distribution of human decisions that are adaptive to the dynamic changes in the environment. The proposed approach was inspired by imitation learning (IL) but with fundamental differences: (a) Instead of aiming to learn an optimal policy based on expert’s demonstrations, we aimed to estimate the distribution of decisions of any group of people. (b) Instead of modeling the environment by an MDP, we used an ambiguity probability model to consider the uncertainty of each decision. (c) The participant trajectory was obtained by categorizing each decision of a participant to a certain cluster based on the commonness in the distribution of decisions. The result shows a feasible way to capture human long-term decision dependency, which provides a complement to the existing machine learning methods for understanding and learning human decision strategies.

Read full abstract

ADHD is associated with altered dopamine regulated reinforcement learning on prediction errors. Despite evidence of categorically altered error processing in ADHD, neuroimaging advances have largely investigated models of normal reinforcement learning in greater detail. Further, although reinforcement leaning critically relies on ventral striatum exerting error magnitude related thresholding influences on substantia nigra (SN) and dorsal striatum, these thresholding influences have never been identified with neuroimaging. To identify such thresholding influences, we propose that error magnitude related activities must first be separated from opposite activities in overlapping neural regions during error detection. Here we separate error detection from magnitude related adjustment (post-error slowing) during inhibition errors in the stop signal task in typically developing (TD) and ADHD adolescents using fMRI. In TD, we predicted that: 1) deactivation of dorsal striatum on error detection interrupts ongoing processing, and should be proportional to right frontoparietal response phase activity that has been observed in the SST; 2) deactivation of ventral striatum on post-error slowing exerts thresholding influences on, and should be proportional to activity in dorsal striatum. In ADHD, we predicted that ventral striatum would instead correlate with heightened amygdala responses to errors. We found deactivation of dorsal striatum on error detection correlated with response-phase activity in both groups. In TD, post-error slowing deactivation of ventral striatum correlated with activation of dorsal striatum. In ADHD, ventral striatum correlated with heightened amygdala activity. Further, heightened activities in locus coeruleus (norepinephrine), raphe nucleus (serotonin) and medial septal nuclei (acetylcholine), which all compete for control of DA, and are altered in ADHD, exhibited altered correlations with SN. All correlations in TD were replicated in healthy adults. Results in TD are consistent with dopamine regulated reinforcement learning on post-error slowing. In ADHD, results are consistent with heightened activities in the amygdala and non-dopaminergic neurotransmitter nuclei preventing reinforcement learning.

Read full abstract

Reinforcement Leaning Research Articles

Related Topics

Articles published on Reinforcement Leaning

Basal ganglia: Uniting circuit logic between matrix and striosome

Zero-Trust Zero-Communication Defence against Hybrid Cyberattacks in Distributed Energy Resources Using Mean Field Reinforcement Leaning

A New AI Approach by Acquisition of Characteristics in Human Decision-Making Process

Digital twin-based reinforcement learning framework: application to autonomous mobile robot dispatching

AI-Assisted Decision-Making and Risk Evaluation in Uncertain Environment Using Stochastic Inverse Reinforcement Learning: American Football as a Case Study

Intelligent Virtual Resource Allocation of QoS-Guaranteed Slices in B5G-Enabled VANETs for Intelligent Transportation Systems

ULMR: An Unsupervised Learning Framework for Mismatch Removal.

Data-based decentralized learning scheme for nonlinear systems with mismatched interconnections

Fuzzy Reinforcement Learning for energy efficient task offloading in Vehicular Fog Computing

Quantum Learning-Enabled Green Communication for Next-Generation Wireless Systems

Particle Swarm Optimization on Deep Reinforcement Learning for Detecting Social Spam Bots and Spam-Influential Users in Twitter Network

Deep learning and case-based reasoning for predictive and adaptive traffic emergency management

Reinforcement-Learning-Based Energy Storage System Operation Strategies to Manage Wind Power Forecast Uncertainty

Інтелектуальне керування орієнтацією космічних апаратів із використанням навчання з підкріпленням

Smart fog based workflow for traffic control networks

Disrupted reinforcement learning during post-error slowing in ADHD.

Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access

강화학습 기반의 다단계 공급망 분배계획

Research on Leaning and Evolutionary Algorithm of Agent for Task Oriented

Off-policy reinforcement learning for H∞ control design.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Leaning Research Articles

Related Topics

Articles published on Reinforcement Leaning

Basal ganglia: Uniting circuit logic between matrix and striosome

Zero-Trust Zero-Communication Defence against Hybrid Cyberattacks in Distributed Energy Resources Using Mean Field Reinforcement Leaning

A New AI Approach by Acquisition of Characteristics in Human Decision-Making Process

Digital twin-based reinforcement learning framework: application to autonomous mobile robot dispatching

AI-Assisted Decision-Making and Risk Evaluation in Uncertain Environment Using Stochastic Inverse Reinforcement Learning: American Football as a Case Study

Intelligent Virtual Resource Allocation of QoS-Guaranteed Slices in B5G-Enabled VANETs for Intelligent Transportation Systems

ULMR: An Unsupervised Learning Framework for Mismatch Removal.

Data-based decentralized learning scheme for nonlinear systems with mismatched interconnections

Fuzzy Reinforcement Learning for energy efficient task offloading in Vehicular Fog Computing

Quantum Learning-Enabled Green Communication for Next-Generation Wireless Systems

Particle Swarm Optimization on Deep Reinforcement Learning for Detecting Social Spam Bots and Spam-Influential Users in Twitter Network

Deep learning and case-based reasoning for predictive and adaptive traffic emergency management

Reinforcement-Learning-Based Energy Storage System Operation Strategies to Manage Wind Power Forecast Uncertainty

Інтелектуальне керування орієнтацією космічних апаратів із використанням навчання з підкріпленням

Smart fog based workflow for traffic control networks

Disrupted reinforcement learning during post-error slowing in ADHD.

Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access

강화학습 기반의 다단계 공급망 분배계획

Research on Leaning and Evolutionary Algorithm of Agent for Task Oriented

Off-policy reinforcement learning for H∞ control design.