Reinforcement Learning Environment Research Articles

A key challenge for AI is to build embodied systems that operate in dynamically changing environments. Such systems must adapt to changing task contexts and learn continuously. Although standard deep learning systems achieve state of the art results on static benchmarks, they often struggle in dynamic scenarios. In these settings, error signals from multiple contexts can interfere with one another, ultimately leading to a phenomenon known as catastrophic forgetting. In this article we investigate biologically inspired architectures as solutions to these problems. Specifically, we show that the biophysical properties of dendrites and local inhibitory systems enable networks to dynamically restrict and route information in a context-specific manner. Our key contributions are as follows: first, we propose a novel artificial neural network architecture that incorporates active dendrites and sparse representations into the standard deep learning framework. Next, we study the performance of this architecture on two separate benchmarks requiring task-based adaptation: Meta-World, a multi-task reinforcement learning environment where a robotic agent must learn to solve a variety of manipulation tasks simultaneously; and a continual learning benchmark in which the model's prediction task changes throughout training. Analysis on both benchmarks demonstrates the emergence of overlapping but distinct and sparse subnetworks, allowing the system to fluidly learn multiple tasks with minimal forgetting. Our neural implementation marks the first time a single architecture has achieved competitive results in both multi-task and continual learning settings. Our research sheds light on how biological properties of neurons can inform deep learning systems to address dynamic scenarios that are typically impossible for traditional ANNs to solve.

Read full abstract

Legged robots are better able to adapt to different terrains compared with wheeled robots. However, traditional motion controllers suffer from extremely complex dynamics properties. Reinforcement learning (RL) helps to overcome the complications of dynamics design and calculation. In addition, the high autonomy of the RL controller results in a more robust response to complex environments and terrains compared with traditional controllers. However, RL algorithms are limited by the problems of convergence and training efficiency due to the complexity of the task. Learn and outperform the reference motion (LORM), an RL based framework for gait controlling of biped robot is proposed leveraging the prior knowledge of reference motion. The proposed trained agent outperformed the reference motion and existing motion-based methods. The RL environment was finely crafted for optimal performance, including the pruning of state space and action space, reward shaping, and design of episode criterion. Several improvements were implemented to further improve the training efficiency and performance including: random state initialization (RSI), the noise of joint angles, and a novel improvement based on symmetrization of gait. To validate the proposed method, the Darwin-op robot was set as the target platform and two different tasks were designed: (I) Walking as fast as possible and (II) Tracking specific velocity. In task (I), the proposed method resulted in the walking velocity of 0.488 m/s, with a 5.8 times improvement compared with the original traditional reference controller. The directional accuracy improved by 87.3%. The velocity performance achieved 2× compared with the rated max velocity and more than 8× compared with other recent works. To our knowledge, our work achieved the best velocity performance on the platform Darwin-op. In task (II), the proposed method achieved a tracking accuracy of over 95%. Different environments are introduced including plains, slopes, uneven terrains, and walking with external force, where the robot was expected to maintain walking stability with ideal speed and little direction deviation, to validate the performance and robustness of the proposed method.

Read full abstract

Reinforcement Learning Environment Research Articles

Related Topics

Articles published on Reinforcement Learning Environment

A modeling environment for reinforcement learning in games

Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning

Programmatic Reward Design by Example

Stability Verification in Stochastic Control Systems via Neural Network Supermartingales

How to Reduce Action Space for Planning Domains? (Student Abstract)

Same State, Different Task: Continual Reinforcement Learning without Interference

Reinforcement Learning Environment for Advanced Vehicular Ad Hoc Networks Communication Systems.

Optimizing low-Reynolds-number predation via optimal control and reinforcement learning

Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection

Compositional Grounded Language for Agent Communication in Reinforcement Learning Environment

Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments.

Sparse Black-Box Video Attack with Reinforcement Learning

Evolution of Agents in the Case of a Balanced Diet

Adaptive Multifactorial Evolutionary Optimization for Multitask Reinforcement Learning

LORM: a novel reinforcement learning framework for biped gait control.

Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning.

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

Reinforcement Learning Based Relay Selection for Underwater Acoustic Cooperative Networks

Gym-saturation: an OpenAI Gym environment for saturation provers

A multi process value-based reinforcement learning environment framework for adaptive traffic signal control

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning Environment Research Articles

Related Topics

Articles published on Reinforcement Learning Environment

A modeling environment for reinforcement learning in games

Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning

Programmatic Reward Design by Example

Stability Verification in Stochastic Control Systems via Neural Network Supermartingales

How to Reduce Action Space for Planning Domains? (Student Abstract)

Same State, Different Task: Continual Reinforcement Learning without Interference

Reinforcement Learning Environment for Advanced Vehicular Ad Hoc Networks Communication Systems.

Optimizing low-Reynolds-number predation via optimal control and reinforcement learning

Improved Exploration in Reinforcement Learning Environments with Low-Discrepancy Action Selection

Compositional Grounded Language for Agent Communication in Reinforcement Learning Environment

Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments.

Sparse Black-Box Video Attack with Reinforcement Learning

Evolution of Agents in the Case of a Balanced Diet

Adaptive Multifactorial Evolutionary Optimization for Multitask Reinforcement Learning

LORM: a novel reinforcement learning framework for biped gait control.

Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning.

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

Reinforcement Learning Based Relay Selection for Underwater Acoustic Cooperative Networks

Gym-saturation: an OpenAI Gym environment for saturation provers

A multi process value-based reinforcement learning environment framework for adaptive traffic signal control