Tabular Reinforcement Learning Research Articles

Recent advancements in AI and deep learning have created a growing demand for artificial agents capable of performing tasks within increasingly complex environments. To address the challenges associated with continuous learning constraints and knowledge capacity in this context, cognitive architectures inspired by human cognition have gained significance. This study contributes to existing research by introducing a cognitive-attentional system employing a constructive neural network-based learning approach for continuous acquisition of procedural knowledge. We replace an incremental tabular Reinforcement Learning algorithm with a constructive neural network deep reinforcement learning mechanism for continuous sensorimotor knowledge acquisition, thereby enhancing the overall learning capacity. The primary emphasis of this modification centers on optimizing memory utilization and reducing training time. Our study presents a learning strategy that amalgamates deep reinforcement learning with procedural learning, mirroring the incremental learning process observed in human sensorimotor development. This approach is embedded within the CONAIM cognitive-attentional architecture, leveraging the cognitive tools of CST. The proposed learning mechanism allows the model to dynamically create and modify elements in its procedural memory, facilitating the reuse of previously acquired functions and procedures. Additionally, it equips the model with the capability to combine learned elements to effectively adapt to complex scenarios. A constructive neural network was employed, initiating with an initial hidden layer comprising one neuron. However, it possesses the capacity to adapt its internal architecture in response to its performance in procedural and sensorimotor learning tasks, inserting new hidden layers or neurons. Experimentation conducted through simulations involving a humanoid robot demonstrates the successful resolution of tasks that were previously unsolved through incremental knowledge acquisition. Throughout the training phase, the constructive agent achieved a minimum of 40% greater rewards and executed 8% more actions when compared to other agents. In the subsequent testing phase, the constructive agent exhibited a 15% increase in the number of actions performed in contrast to its counterparts.

Read full abstract

Initiation of the antiarrhythmic medication dofetilide requires an FDA-mandated 3 days of telemetry monitoring due to heightened risk of toxicity within this time period. Although a recommended dose management algorithm for dofetilide exists, there is a range of real-world approaches to dosing the medication. In this multicenter investigation, clinical data from the Antiarrhythmic Drug Genetic (AADGEN) study was examined for 354 patients undergoing dofetilide initiation. Univariate logistic regression identified a starting dofetilide dose of 500 mcg (OR 5.0, 95%CI 2.5-10.0, p<0.001) and sinus rhythm at the start of dofetilide loading (OR 2.8, 95%CI 1.8-4.2, p<0.001) as strong positive predictors of successful loading. Any dose-adjustment during loading (OR 0.19, 95%CI 0.12-0.31, p<0.001) and a history coronary artery disease (OR 0.33, 95%CI 0.19-0.59, p<0.001) were strong negative predictors of successful dofetilide loading. Based on the observation that any dose adjustment was a significant negative predictor of successful initiation, we applied multiple supervised approaches to attempt to predict the dose adjustment decision, but none of these approaches identified dose adjustments better than a probabilistic guess. Principal component analysis and cluster analysis identified 8 clusters as a reasonable data reduction method. These 8 clusters were then used to define patient states in a tabular reinforcement learning model trained on 80% of dosing decisions. Testing of this model on the remaining 20% of dosing decisions revealed good accuracy of the reinforcement learning model, with only 16/410 (3.9%) instances of disagreement. Dose adjustments are a strong determinant of whether patients are able to successfully initiate dofetilide. A reinforcement learning algorithm informed by unsupervised learning was able to predict dosing decisions with 96.1% accuracy. Future studies will apply this algorithm prospectively as a data-driven decision aid.

Read full abstract

Tabular Reinforcement Learning Research Articles

Related Topics

Articles published on Tabular Reinforcement Learning

Ex-RL: Experience-based reinforcement learning

A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents

Off-policy evaluation for tabular reinforcement learning with synthetic trajectories

Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism

Design Synthesis of Structural Systems as a Markov Decision Process Solved With Deep Reinforcement Learning

Adaptive Traffic Signal Control With Deep Reinforcement Learning and High Dimensional Sensory Inputs: Case Study and Comprehensive Sensitivity Analyses

Deep Reinforcement Learning for Black-box Testing of Android Apps

Near-optimal energy management for plug-in hybrid fuel cell and battery propulsion using deep reinforcement learning

Discrete-to-deep reinforcement learning methods

QR-SDN: Towards Reinforcement Learning States, Actions, and Rewards for Direct Flow Routing in Software-Defined Networks

Applications of machine learning in decision analysis for dose management for dofetilide.

An effective asynchronous framework for small scale reinforcement learning problems

Asynchronous reinforcement learning algorithms for solving discrete space path planning problems

Leveraging human knowledge in tabular reinforcement learning: a study of human subjects

Benchmarking Projective Simulation in Navigation Problems

Learning classifier systems from a reinforcement learning perspective

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tabular Reinforcement Learning Research Articles

Related Topics

Articles published on Tabular Reinforcement Learning

Ex-RL: Experience-based reinforcement learning

A Procedural Constructive Learning Mechanism with Deep Reinforcement Learning for Cognitive Agents

Off-policy evaluation for tabular reinforcement learning with synthetic trajectories

Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism

Design Synthesis of Structural Systems as a Markov Decision Process Solved With Deep Reinforcement Learning

Adaptive Traffic Signal Control With Deep Reinforcement Learning and High Dimensional Sensory Inputs: Case Study and Comprehensive Sensitivity Analyses

Deep Reinforcement Learning for Black-box Testing of Android Apps

Near-optimal energy management for plug-in hybrid fuel cell and battery propulsion using deep reinforcement learning

Discrete-to-deep reinforcement learning methods

QR-SDN: Towards Reinforcement Learning States, Actions, and Rewards for Direct Flow Routing in Software-Defined Networks

Applications of machine learning in decision analysis for dose management for dofetilide.

An effective asynchronous framework for small scale reinforcement learning problems

Asynchronous reinforcement learning algorithms for solving discrete space path planning problems

Leveraging human knowledge in tabular reinforcement learning: a study of human subjects

Benchmarking Projective Simulation in Navigation Problems

Learning classifier systems from a reinforcement learning perspective