Simulation-based evaluation of model-free reinforcement learning algorithms for quadcopter attitude control and trajectory tracking

Pablo Caffyn Yuste,José Antonio Iglesias Martínez,María Araceli Sanchis De Miguel

doi:10.1016/j.neucom.2024.128362

Abstract

General use quadcopters have been under development for over a decade but many of their potential applications are still under evaluation and have not yet been adopted in many of the areas that could benefit from their use. While the current generation of quadcopters use a mature set of control algorithms, the next steps, especially as autonomous features are developed, should involve a more complex learning capability to be able to adapt to unknown circumstances in a safe and reliable way. This paper provides baseline quadcopter control models learnt using eight general reinforcement learning (RL) algorithms in a simulated environment, with the object of establishing a reference performance, both in terms of precision and generation cost, for a simple set of trajectories. Each algorithm uses a tailored set of hyperparameters while, additionally, the influence of random seeds is also studied. While not all algorithms converge in the allocated computing budget, the more complex ones are able to provide stable and precise control models. This paper recommends the use of the TD3 algorithm as a reference for comparison with new RL algorithms. Additional guidance for future work is provided based on the weaknesses identified in the learning process, especially regarding the strong dependence of agent performance on random seeds.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simulation-based evaluation of model-free reinforcement learning algorithms for quadcopter attitude control and trajectory tracking

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Aug 14, 2024
License type: cc-by-nc-nd

Similar Papers

Accelerating autonomous learning by using heuristic selection of actions
Reinaldo A C Bianchi ... Anna H R Costa
Journal of Heuristics | VOL. 14
Reinaldo A C Bianchi, et. al.Reinaldo A C Bianchi ... Anna H R Costa
04 May 2007
Journal of Heuristics | VOL. 14

Dynamic Economic Optimization of a Continuously Stirred Tank Reactor Using Reinforcement Learning
Derek Machalek ... Titus Quah
-
Derek Machalek, et. al.Derek Machalek ... Titus Quah
01 Jul 2020
01 Jul 2020

PMA-DRL: A parallel model-augmented framework for deep reinforcement learning algorithms
Xufang Luo ... Yunhong Wang
Neurocomputing | VOL. 403
Xufang Luo, et. al.Xufang Luo ... Yunhong Wang
25 Apr 2020
Neurocomputing | VOL. 403

Robust Deep Reinforcement Learning for Security and Safety in Autonomous Vehicle Systems
Aidin Ferdowsi ... Walid Saad
-
Aidin Ferdowsi, et. al.Aidin Ferdowsi ... Walid Saad
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simulation-based evaluation of model-free reinforcement learning algorithms for quadcopter attitude control and trajectory tracking

Abstract

Talk to us

Similar Papers

More From: Neurocomputing