Abstract
This work explores the application of reinforcement learning to control the descend/landing phase of a reusable rocket model, where the rocket must track a nominal/planar reference path with adequate attitude. To tackle the control problem, which is multivariable, a cascade control architecture is proposed, consisting of three single-input single-output controllers. The inner control loop is responsible for controlling the attitude using a reference for attitude provided by the outer loop and the thrust vectoring, where the gimbal angle is the manipulated variable. The position error is handled by the external loop, that handles the altitude error, used to adjust thrust. Several configurations, using Linear Quadratic Gaussian (LQG) control (LQ regulator with a Kalman observer), PID control and adaptive control based on Reinforcement Learning (optimal control with learning capability based on a linear regression model), are evaluated and compared.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have