Experimental Validation on Aerial Vehicles of Real-Time Motion Planning with Continuous-Time Q-Learning

Christian Llanes,Joshua Netter,Kyriakos G Vamvoudakis,Samuel Coogan

doi:10.1016/j.ifacol.2023.11.006

Christian Llanes, Joshua Netter + Show 2 more

Open Access

https://doi.org/10.1016/j.ifacol.2023.11.006

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

In this paper, we propose an algorithm and implementation for real-time optimized kinodynamic motion planning for aerial vehicles with unknown dynamics in crowded environments. A random-sampling space-filling tree is used for both planning and rapidly replanning a path through the environment. Then, continuous-time Q-learning is used to approximately solve the resulting finite-horizon optimal control problem online to optimally track the planned path. To facilitate the Q-learning, we propose an actor-critic structure with integral reinforcement learning to approximate the Hamilton-Jacobi-Bellman equation. The critic approximates the Q-function while the actor approximates the control policy. We demonstrate our approach on custom drone hardware in which all planning, learning, and control computations are conducted onboard in real-time.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Experimental Validation on Aerial Vehicles of Real-Time Motion Planning with Continuous-Time Q-Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Similar Papers

Toward Socially Aware Robot Navigation in Dynamic and Crowded Environments: A Proactive Social Motion Model
Xuan-Tung Truong ... Trung Dung Ngo
IEEE Transactions on Automation Science and Engineering | VOL. 14
Xuan-Tung Truong, et. al.Xuan-Tung Truong ... Trung Dung Ngo
01 Oct 2017
IEEE Transactions on Automation Science and Engineering | VOL. 14

Finite-horizon robust formation-containment control of multi-agent networks with unknown dynamics
Di Yu ... Peng Wang
Neurocomputing | VOL. 458
Di Yu, et. al.Di Yu ... Peng Wang
21 Jan 2021
Neurocomputing | VOL. 458

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning
Chonglin Jing ... Hongkai Song
Mathematics | VOL. 12
Chonglin Jing, et. al.Chonglin Jing ... Hongkai Song
18 Aug 2024
Mathematics | VOL. 12

Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems.
Zhenhui Xu ... Tielong Shen
IEEE transactions on neural networks and learning systems | VOL. 33
Zhenhui Xu, et. al.Zhenhui Xu ... Tielong Shen
21 Dec 2020
IEEE transactions on neural networks and learning systems | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Experimental Validation on Aerial Vehicles of Real-Time Motion Planning with Continuous-Time Q-Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine