Abstract

This article studies the optimal control policy learning for underactuated vertical take-off and landing (VTOL) aerial vehicles subject to the unknown mass and inertia matrix. A novel off-policy integral reinforcement learning (IRL) scheme is presented for simultaneously unknown parameter identification and optimal trajectory tracking. In the outer loop of the VTOL vehicles, a novel off-policy IRL scheme is proposed, where the fixed control policy for data generation is chosen to be different from the iterated control policy and the feedforward term with an unknown mass can be learned along with the optimal control policy. In the inner loop, a hybrid off-policy IRL algorithm is developed to tackle the optimal attitude control policy learning and inertia matrix identification under the hybrid control scheme introduced by the employed inner–outer loop control strategy. A simulation study is finally provided to demonstrate the effectiveness of the proposed algorithm.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call