Reinforcement Learning for UAV Attitude Control

William Koch,Renato Mancuso,Azer Bestavros,Richard West

doi:10.1145/3301273

Abstract

Autopilot systems are typically composed of an “inner loop” providing stability and control, whereas an “outer loop” is responsible for mission-level objectives, such as way-point navigation. Autopilot systems for unmanned aerial vehicles are predominately implemented using Proportional-Integral-Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However, more sophisticated control is required to operate in unpredictable and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL), which has had success in other applications, such as robotics. Yet previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude control when using intelligent flight control systems trained with state-of-the-art RL algorithms—Deep Deterministic Policy Gradient, Trust Region Policy Optimization, and Proximal Policy Optimization. To investigate these unknowns, we first developed an open source high-fidelity simulation environment to train a flight controller attitude control of a quadrotor through RL. We then used our environment to compare their performance to that of a PID controller to identify if using RL is appropriate in high-precision, time-critical flight control.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning for UAV Attitude Control

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Cyber-Physical Systems

Lead the way for us

Journal: ACM Transactions on Cyber-Physical Systems	Publication Date: Feb 13, 2019
Citations: 319

Similar Papers

Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy optimization
Eivind Bohn ... Tor Ame Johansen
-
Eivind Bohn, et. al.Eivind Bohn ... Tor Ame Johansen
01 Jun 2019
01 Jun 2019

Structural and Functional Model of the On-board Expert Control System for a Prospective Unmanned Aerial Vehicle
P I Tutubalin ... V V Mokshin
-
P I Tutubalin, et. al.P I Tutubalin ... V V Mokshin
01 Jan 2020
01 Jan 2020

Authentic Boundary Proximal Policy Optimization.
Yuhu Cheng ... Xuesong Wang
IEEE transactions on cybernetics | VOL. 52
Yuhu Cheng, et. al.Yuhu Cheng ... Xuesong Wang
11 Mar 2021
IEEE transactions on cybernetics | VOL. 52

What is the value of the cross-sectional approach to deep reinforcement learning?
Amine Mohamed Aboussalah ... Chi-Guhn Lee
Quantitative Finance | VOL. ahead-of-print
Amine Mohamed Aboussalah, et. al.Amine Mohamed Aboussalah ... Chi-Guhn Lee
07 Dec 2021
Quantitative Finance | VOL. ahead-of-print

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for UAV Attitude Control

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Cyber-Physical Systems