Design Of Algorithms Research Articles

Reinforcement learning (RL) agents are vulnerable to adversarial disturbances, which can deteriorate task performance or break down safety specifications. Existing methods either address safety requirements under the assumption of no adversary (e.g., safe RL) or only focus on robustness against performance adversaries (e.g., robust RL). Learning one policy that is both safe and robust under any adversaries remains a challenging open problem. The difficulty is how to tackle two intertwined aspects in the worst cases: feasibility and optimality. The optimality is only valid inside a feasible region (i.e., robust invariant set), while the identification of maximal feasible region must rely on how to learn the optimal policy. To address this issue, we propose a systematic framework to unify safe RL and robust RL, including the problem formulation, iteration scheme, convergence analysis and practical algorithm design. The unification is built upon constrained two-player zero-sum Markov games, in which the objective for protagonist is twofold. For states inside the maximal robust invariant set, the goal is to pursue rewards under the condition of guaranteed safety; for states outside the maximal robust invariant set, the goal is to reduce the extent of constraint violation. A dual policy iteration scheme is proposed, which simultaneously optimizes a task policy and a safety policy. We prove that the iteration scheme converges to the optimal task policy which maximizes the twofold objective in the worst cases, and the optimal safety policy which stays as far away from the safety boundary. The convergence of safety policy is established by exploiting the monotone contraction property of safety self-consistency operators, and that of task policy depends on the transformation of safety constraints into state-dependent action spaces. By adding two adversarial networks (one is for safety guarantee and the other is for task performance), we propose a practical deep RL algorithm for constrained zero-sum Markov games, called dually robust actor-critic (DRAC). The evaluations with safety-critical benchmarks demonstrate that DRAC achieves high performance and persistent safety under all scenarios (no adversary, safety adversary, performance adversary), outperforming all baselines by a large margin.

Read full abstract

We develop a computational algorithm based on a diffuse interface approach to study the design of bioartificial organ scaffold architectures. These scaffolds, composed of poroelastic hydrogels housing transplanted cells, are linked to the patient's blood circulation via an anastomosis graft. Before entering the scaffold, the blood flow passes through a filter, and the resulting filtered blood plasma transports oxygen and nutrients to sustain the viability of transplanted cells over the long term. A key issue in maintaining cell viability is the design of ultrafiltrate channels within the hydrogel scaffold to facilitate advection-enhanced oxygen supply ensuring oxygen levels remain above a critical threshold to prevent hypoxia. In this manuscript, we develop a computational algorithm to analyze the plasma flow and oxygen concentration within hydrogels featuring various channel geometries. Our objective is to identify the optimal hydrogel channel architecture that sustains oxygen concentration throughout the scaffold above the critical hypoxic threshold. The computational algorithm we introduce here employs a diffuse interface approach to solve a multi-physics problem. The corresponding model couples the time-dependent Stokes equations, governing blood plasma flow through the channel network, with the time-dependent Biot equations, characterizing Darcy velocity, pressure, and displacement within the poroelastic hydrogel containing the transplanted cells. Subsequently, the calculated plasma velocity is utilized to determine oxygen concentration within the scaffold using a diffuse interface advection-reaction-diffusion model. Our investigation yields a scaffold architecture featuring a hexagonal network geometry that meets the desired oxygen concentration criteria. Unlike classical sharp interface approaches, the diffuse interface approach we employ is particularly adept at addressing problems with intricate interface geometries, such as those encountered in bioartificial organ scaffold design. This study is significant because recent developments in hydrogel fabrication make it now possible to control hydrogel rheology and utilize computational results to generate optimized scaffold architectures.

Read full abstract

Design Of Algorithms Research Articles

Articles published on Design Of Algorithms

Latency-Aware Unified Dynamic Networks for Efficient Image Recognition.

BEATRIX: An open source humanoid head platform for robotics teaching and research

GACT-PPIS: Prediction of protein-protein interaction sites based on graph structure and transformer network

Distributed Algorithm Design for Nonsmooth and Nonlinear Resource Allocation Problems of Autonomous High-Order Agents and Its Application to Smart Grids

Safe Reinforcement Learning With Dual Robustness.

Automatic design generation of trusses from a reused steel stock library using graphic statics

Countering flaws in algorithm design and applications: a Delphi study

Designing proteins: Mimicking natural protein sequence heterogeneity.

Optically Transparent Complex-Amplitude Metasurface for Full-Space Manipulation of Frequency-Multiplexed Holographic Imaging.

ELLIPSOIDAL DESIGN OF SLIDING MODE CONTROL FOR CAR ACTIVE SUSPENSION SYSTEM

On Process Discovery Experimentation

Average Case Subquadratic Exact and Heuristic Procedures for the Traveling Salesman 2-OPT Neighborhood

The Design of a Script Identification Algorithm and Its Application in Constructing a Text Language Identification Dataset

A computational algorithm for optimal design of a bioartificial organ scaffold architecture.

CRISPRoffT: comprehensive database of CRISPR/Cas off-targets.

Image modeling algorithm for environment design based on augmented and virtual reality technologies

Recent Advances in controlled manipulation of micro/nano particles: A review

A multi-objective energy-efficient scheduling algorithm for solving hybrid flow workshops with parallel heterospeed machines

Merged-nets enumeration for the systematic design of multicomponent reticular structures.

Critical Assessment of Protein Engineering (CAPE): A Student Challenge on the Cloud.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Design Of Algorithms Research Articles

Articles published on Design Of Algorithms

Latency-Aware Unified Dynamic Networks for Efficient Image Recognition.

BEATRIX: An open source humanoid head platform for robotics teaching and research

GACT-PPIS: Prediction of protein-protein interaction sites based on graph structure and transformer network

Distributed Algorithm Design for Nonsmooth and Nonlinear Resource Allocation Problems of Autonomous High-Order Agents and Its Application to Smart Grids

Safe Reinforcement Learning With Dual Robustness.

Automatic design generation of trusses from a reused steel stock library using graphic statics

Countering flaws in algorithm design and applications: a Delphi study

Designing proteins: Mimicking natural protein sequence heterogeneity.

Optically Transparent Complex-Amplitude Metasurface for Full-Space Manipulation of Frequency-Multiplexed Holographic Imaging.

ELLIPSOIDAL DESIGN OF SLIDING MODE CONTROL FOR CAR ACTIVE SUSPENSION SYSTEM

On Process Discovery Experimentation

Average Case Subquadratic Exact and Heuristic Procedures for the Traveling Salesman 2-OPT Neighborhood

The Design of a Script Identification Algorithm and Its Application in Constructing a Text Language Identification Dataset

A computational algorithm for optimal design of a bioartificial organ scaffold architecture.

CRISPRoffT: comprehensive database of CRISPR/Cas off-targets.

Image modeling algorithm for environment design based on augmented and virtual reality technologies

Recent Advances in controlled manipulation of micro/nano particles: A review

A multi-objective energy-efficient scheduling algorithm for solving hybrid flow workshops with parallel heterospeed machines

Merged-nets enumeration for the systematic design of multicomponent reticular structures.

Critical Assessment of Protein Engineering (CAPE): A Student Challenge on the Cloud.