Abstract

This letter presents a combination of reinforcement learning (RL) and deterministic controllers to learn a quadrotor control. Learning the quadrotor flight in a standard RL approach requires many iterations of trial and error, which may bring about risky exploration and battery consumption. In this letter, we integrate a classical controller such as PD (proportional and derivative) or LQR (linear quadratic regulator) with a RL policy using their linear combination. The proposed method is not only simple to use, but also fast in learning convergence. When the algorithm is evaluated for a quadrotor trajectory tracking by means of a velocity control for both simulation and experiment, it demonstrates the faster convergence rate and better control performance in comparison with an existing rapid model-based RL method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call