Abstract

This paper studies a Non-convex State-dependent Linear Quadratic Regulator (NSLQR) problem, in which the control penalty weighting matrix in the performance index is state-dependent. A necessary and sufficient condition for the optimal solution is established with a rigorous proof by Euler-Lagrange Equation. It is found that the optimal solution of the NSLQR problem can be obtained by solving a Pseudo-Differential-Riccati-Equation (PDRE) simultaneously with the closed-loop system equation. A Comparison Theorem for the PDRE is given to facilitate solution methods for the PDRE. A linear time-variant system is employed as an example in simulation to verify the proposed optimal solution. As a non-trivial application, a goal pursuit process in psychology is modeled as a NSLQR problem and two typical goal pursuit behaviors found in human and animals are reproduced using different control weighting . It is found that these two behaviors save control energy and cause less stress over Conventional Control Behavior typified by the LQR control with a constant control weighting , in situations where only the goal discrepancy at the terminal time is of concern, such as in Marathon races and target hitting missions.

Highlights

  • Introduction1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtf

  • 1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtfJ(x(t),u(t))~ L(x(t),u(t),t)dtzw(x(tf ),tf ) t0 1 ðtf ~(xT (t)Q(t)x(t)zuT (t)R(x(t))u(t))dt ð1Þ 2 t0 z xTS(tf )x(tf )is minimized along the associated closed-loop system trajectory of the Linear Time-variant (LTV) system x_ (t)~A(t)x(t)zB(t)u(t) x(t0)~x0ð2Þ where u(t)[Rm is the control input, x(t)[Rn is the system state, t0 is the starting time, tf is the terminal time and x0 is the initial value of x(t) at time t0

  • The main result of this paper is presented: the necessary and sufficient condition of the optimality of the solution to the Non-convex State-dependent Linear Quadratic Regulator (NSLQR) problem defined in Eq (1) and (2)

Read more

Summary

Introduction

1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtf. J(x(t),u(t))~ L(x(t),u(t),t)dtzw(x(tf ),tf ) t0 1 ðtf ~. (xT (t)Q(t)x(t)zuT (t)R(x(t))u(t))dt ð1Þ 2 t0 z xT (tf )S(tf )x(tf ). Is minimized along the associated closed-loop system trajectory of the Linear Time-variant (LTV) system x_ (t)~A(t)x(t)zB(t)u(t) x(t0)~x0. Ð2Þ where u(t)[Rm is the control input, x(t)[Rn is the system state, t0 is the starting time, tf is the terminal time and x0 is the initial value of x(t) at time t0. The dependence of variables on t is omitted when no confusion will be introduced in the rest of the paper. It is assumed that A, B, Q are continuous LR in t, R(x) is differentiable with respect to x, and Lx is bounded

Objectives
Findings
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.