Abstract
A learning-based online optimal sliding-mode control strategy is developed for space circumnavigation missions subject to input constraints, and the mismatched uncertainties caused by measurement uncertainties are also considered in this infinite-horizon optimal control problem. The logarithmic hyperbolic cosine function is used to design the optimal value function to overcome the weakness that the derivative of the adaptive weight of neural network (NN) changes too fast when the value of the sliding-mode function is too large, and another suitable nonquadratic function is used to incorporate input constraints into the optimal control framework. To approximate the Hamiton-Jacobi-Bellman equation corresponding to the novel optimal value function, an actor-critic (AC) architecture is introduced with NNs, and a finite-time disturbance observer (FTDO) is employed to estimate the mismatched uncertainties in the plant. The simulation results verify the effectiveness of the proposed approach.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.