Deep Reinforcement Learning Policies Research Articles

An efficient active structural control scheme by utilizing the collective concepts of data-driven and physics-inspired reinforcement learning (RL) approaches is being introduced here. The controller is designed to deliver optimal feedback forces based on the full-state information, wherein the training process involves the use of deep neural networks (NNs) and a specially designed gradient descent-based sequence within the RL framework. This integration of algorithms results in a unique active control policy that accelerates the learning process, thereby demands considerably less computational resources to develop an optimal and stable controller as compared to existing data-driven approaches. Most importantly, the mentioned data-driven and physics-inspired approaches encompass deep deterministic policy gradient (DDPG) and an iterative gradient-based state feedback control (SFSC) algorithms, respectively, to establish the architecture of the hybrid control policy, referred to as hybrid RL-controller. Additional advantage of the hybrid RL-controller is its ability to operate in continuous state–action spaces, which allows the designed controller to address diverse structural control problems. The staggered performance of the hybrid RL-controller, both in continuous and discrete time, is evaluated in three case studies that involve structures in linear and nonlinear regimes. The outcomes include a detailed comparison of the hybrid RL-controller with the individually designed DDPG and SFSC RL control strategies, as well as the uncontrolled scenario. Furthermore, this study thoroughly investigates the real-life implementation concerns of control strategies, such as perturbations in model parameters and input forces, and time delays in the feedback loop. Finally, the results from this study corroborate that the designed RL-controller showcases superior performance and faster execution time in the feedback loop, making it suitable for the vibration control of multidimensional complex structures.

Read full abstract

Preparation to address the critical gap in a future pandemic between non-pharmacological measures and the deployment of new drugs/vaccines requires addressing two factors: 1) finding virus/pathogen-agnostic pathophysiological targets to mitigate disease severity and 2) finding a more rational approach to repurposing existing drugs. It is increasingly recognized that acute viral disease severity is heavily driven by the immune response to the infection ("cytokine storm" or "cytokine release syndrome"). There exist numerous clinically available biologics that suppress various pro-inflammatory cytokines/mediators, but it is extremely difficult to identify clinically effective treatment regimens with these agents. We propose that this is a complex control problem that resists standard methods of developing treatment regimens and accomplishing this goal requires the application of simulation-based, model-free deep reinforcement learning (DRL) in a fashion akin to training successful game-playing artificial intelligences (AIs). This proof-of-concept study determines if simulated sepsis (e.g. infection-driven cytokine storm) can be controlled in the absence of effective antimicrobial agents by targeting cytokines for which FDA-approved biologics currently exist. We use a previously validated agent-based model, the Innate Immune Response Agent-based Model (IIRABM), for control discovery using DRL. DRL training used a Deep Deterministic Policy Gradient (DDPG) approach with a clinically plausible control interval of 6 hours with manipulation of six cytokines for which there are existing drugs: Tumor Necrosis Factor (TNF), Interleukin-1 (IL-1), Interleukin-4 (IL-4), Interleukin-8 (IL-8), Interleukin-12 (IL-12) and Interferon-γ(IFNg). DRL trained an AI policy that could improve outcomes from a baseline Recovered Rate of 61% to one with a Recovered Rate of 90% over ~21 days simulated time. This DRL policy was then tested on four different parameterizations not seen in training representing a range of host and microbe characteristics, demonstrating a range of improvement in Recovered Rate by +33% to +56. The current proof-of-concept study demonstrates that significant disease severity mitigation can potentially be accomplished with existing anti-mediator drugs, but only through a multi-modal, adaptive treatment policy requiring implementation with an AI. While the actual clinical implementation of this approach is a projection for the future, the current goal of this work is to inspire the development of a research ecosystem that marries what is needed to improve the simulation models with the development of the sensing/assay technologies to collect the data needed to iteratively refine those models.

Read full abstract

Deep Reinforcement Learning Policies Research Articles

Articles published on Deep Reinforcement Learning Policies

Multi-agent long-distance end-to-end indoor navigation: using imitation learning pre-training and global map

A versatile door opening system with mobile manipulator through adaptive position-force control and reinforcement learning

Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control

Continuous control of structural vibrations using hybrid deep reinforcement learning policy

Deep reinforcement learning sensor scheduling for effective monitoring of dynamical systems

Unleashing mixed-reality capability in Deep Reinforcement Learning-based robot motion generation towards safe human–robot collaboration

π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control

Cooperative Deep Reinforcement Learning Policies for Autonomous Navigation in Complex Environments

Evolving interpretable decision trees for reinforcement learning

Multi-agent reinforcement learning satellite guidance for triangulation of a moving object in a relative orbit frame

Pretty Darn Good Control: When are Approximate Solutions Better than Approximate Models.

Spatio-Temporal Graph Convolutional Neural Networks for Physics-Aware Grid Learning Algorithms

Deep reinforcement learning and 3D physical environments applied to crowd evacuation in congested scenarios

Industrial Cross-Robot Transfer Learning

Preparing for the next pandemic: Simulation-based deep reinforcement learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents.

Sim–Real Mapping of an Image-Based Robot Arm Controller Using Deep Reinforcement Learning

A DRL-Driven Intelligent Optimization Strategy for Resource Allocation in Cloud-Edge-End Cooperation Environments

RSAC: A Robust Deep Reinforcement Learning Strategy for Dimensionality Perturbation

Deep-Reinforcement-Learning-Based Resource Allocation for Content Distribution in Fog Radio Access Networks

Frame-Correlation Transfers Trigger Economical Attacks on Deep Reinforcement Learning Policies.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Reinforcement Learning Policies Research Articles

Articles published on Deep Reinforcement Learning Policies

Multi-agent long-distance end-to-end indoor navigation: using imitation learning pre-training and global map

A versatile door opening system with mobile manipulator through adaptive position-force control and reinforcement learning

Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control

Continuous control of structural vibrations using hybrid deep reinforcement learning policy

Deep reinforcement learning sensor scheduling for effective monitoring of dynamical systems

Unleashing mixed-reality capability in Deep Reinforcement Learning-based robot motion generation towards safe human–robot collaboration

π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control

Cooperative Deep Reinforcement Learning Policies for Autonomous Navigation in Complex Environments

Evolving interpretable decision trees for reinforcement learning

Multi-agent reinforcement learning satellite guidance for triangulation of a moving object in a relative orbit frame

Pretty Darn Good Control: When are Approximate Solutions Better than Approximate Models.

Spatio-Temporal Graph Convolutional Neural Networks for Physics-Aware Grid Learning Algorithms

Deep reinforcement learning and 3D physical environments applied to crowd evacuation in congested scenarios

Industrial Cross-Robot Transfer Learning

Preparing for the next pandemic: Simulation-based deep reinforcement learning to discover and test multimodal control of systemic inflammation using repurposed immunomodulatory agents.

Sim–Real Mapping of an Image-Based Robot Arm Controller Using Deep Reinforcement Learning

A DRL-Driven Intelligent Optimization Strategy for Resource Allocation in Cloud-Edge-End Cooperation Environments

RSAC: A Robust Deep Reinforcement Learning Strategy for Dimensionality Perturbation

Deep-Reinforcement-Learning-Based Resource Allocation for Content Distribution in Fog Radio Access Networks

Frame-Correlation Transfers Trigger Economical Attacks on Deep Reinforcement Learning Policies.