Low-level Policies Research Articles

Autonomous control in high-dimensional, continuous state spaces is a persistent and important challenge in the fields of robotics and artificial intelligence. Because of high risk and complexity, the adoption of AI for autonomous combat systems has been a long-standing difficulty. In order to address these issues, DARPA's AlphaDogfight Trials (ADT) program sought to vet the feasibility of and increase trust in AI for autonomously piloting an F-16 in simulated air-to-air combat. Our submission to ADT solves the high-dimensional, continuous control problem using a novel hierarchical deep reinforcement learning approach consisting of a high-level policy selector and a set of separately trained low-level policies specialized for excelling in specific regions of the state space. Both levels of the hierarchy are trained using off-policy, maximum entropy methods with expert knowledge integrated through reward shaping. Our approach outperformed human expert pilots and achieved a second-place rank in the ADT championship event. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Impact Statement–</i> Significant performance milestones in reinforcement learning have been achieved in recent years, with autonomous agents demonstrating super-human performance across a wide variety of tasks. Before these algorithms can be extensively deployed in real-world defense applications, a greater level of trust must first be achieved. ADT was an important step towards developing the trust necessary to operationalize these algorithms, by demonstrating their effectiveness on a foundational yet relevant problem in a high-fidelity simulation environment. Developed for the program, our hierarchical reinforcement learning agent was designed alongside of and competed against active fighter pilots, and ultimately defeated a graduate of the United States Air Force's F-16 Weapons Instructor Course in match play.

Low-level Policies Research Articles

Related Topics

Articles published on Low-level Policies

Energy management for object tracking under the energy-harvesting: Hierarchical reinforcement learning method

Hierarchical Knowledge-Enhancement Framework for multi-hop knowledge graph reasoning

Data-Driven Self-Triggered Control for Networked Motor Control Systems Using RNNs and Pre-Training: A Hierarchical Reinforcement Learning Framework.

Hierarchical Adversarial Inverse Reinforcement Learning.

Boosting Reinforcement Learning via Hierarchical Game Playing With State Relay.

Spatial memory-augmented visual navigation based on hierarchical deep reinforcement learning in unknown environments

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

Integrated train timetabling and rolling stock rescheduling for a disturbed metro system: A hybrid deep reinforcement learning and adaptive large neighborhood search approach

H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem

State-Conditioned Adversarial Subgoal Generation

Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target

Integrated rescheduling of train timetables and rolling stock circulation for metro line disturbance management: a Q-learning-based approach

Hierarchical Vision Navigation System for Quadruped Robots with Foothold Adaptation Learning.

A Hierarchical Compliance-Based Contextual Policy Search for Robotic Manipulation Tasks With Multiple Objectives

Adjacency Constraint for Efficient Hierarchical Reinforcement Learning.

Admission-Based Reinforcement-Learning Algorithm in Sequential Social Dilemmas

Learning Task-Agnostic Action Spaces for Movement Optimization.

A Multiagent Cooperative Decision-Making Method for Adaptive Intersection Complexity Based on Hierarchical RL

Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning

Hierarchical reinforcement learning for automatic disease diagnosis.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-level Policies Research Articles

Related Topics

Articles published on Low-level Policies

Energy management for object tracking under the energy-harvesting: Hierarchical reinforcement learning method

Hierarchical Knowledge-Enhancement Framework for multi-hop knowledge graph reasoning

Data-Driven Self-Triggered Control for Networked Motor Control Systems Using RNNs and Pre-Training: A Hierarchical Reinforcement Learning Framework.

Hierarchical Adversarial Inverse Reinforcement Learning.

Boosting Reinforcement Learning via Hierarchical Game Playing With State Relay.

Spatial memory-augmented visual navigation based on hierarchical deep reinforcement learning in unknown environments

Hierarchical Reinforcement Learning for Air Combat at DARPA's AlphaDogfight Trials

Integrated train timetabling and rolling stock rescheduling for a disturbed metro system: A hybrid deep reinforcement learning and adaptive large neighborhood search approach

H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem

State-Conditioned Adversarial Subgoal Generation

Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target

Integrated rescheduling of train timetables and rolling stock circulation for metro line disturbance management: a Q-learning-based approach

Hierarchical Vision Navigation System for Quadruped Robots with Foothold Adaptation Learning.

A Hierarchical Compliance-Based Contextual Policy Search for Robotic Manipulation Tasks With Multiple Objectives

Adjacency Constraint for Efficient Hierarchical Reinforcement Learning.

Admission-Based Reinforcement-Learning Algorithm in Sequential Social Dilemmas

Learning Task-Agnostic Action Spaces for Movement Optimization.

A Multiagent Cooperative Decision-Making Method for Adaptive Intersection Complexity Based on Hierarchical RL

Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning

Hierarchical reinforcement learning for automatic disease diagnosis.