Deep reinforcement learning for quadrotor path following with adaptive velocity

Bartomeu Rubí,Bernardo Morcego,Ramon Pérez

doi:10.1007/s10514-020-09951-8

Bartomeu Rubí, Bernardo Morcego + Show 1 more

Open Access

https://doi.org/10.1007/s10514-020-09951-8

Copy DOI

Journal: Autonomous robots	Publication Date: Oct 24, 2020
Citations: 24	License type: other-oa

Affiliation: Universitat Politècnica de Catalunya

Abstract

This paper proposes a solution for the path following problem of a quadrotor vehicle based on deep reinforcement learning theory. Three different approaches implementing the Deep Deterministic Policy Gradient algorithm are presented. Each approach emerges as an improved version of the preceding one. The first approach uses only instantaneous information of the path for solving the problem. The second approach includes a structure that allows the agent to anticipate to the curves. The third agent is capable to compute the optimal velocity according to the path’s shape. A training framework that combines the tensorflow-python environment with Gazebo-ROS using the RotorS simulator is built. The three agents are tested in RotorS and experimentally with the Asctec Hummingbird quadrotor. Experimental results prove the validity of the agents, which are able to achieve a generalized solution for the path following problem.

Full Text