The optimal solution of a non-convex state-dependent LQR problem and its applications.

Xudan Xu,J Jim Zhu,Ping Zhang

doi:10.1371/journal.pone.0094925

Abstract

This paper studies a Non-convex State-dependent Linear Quadratic Regulator (NSLQR) problem, in which the control penalty weighting matrix in the performance index is state-dependent. A necessary and sufficient condition for the optimal solution is established with a rigorous proof by Euler-Lagrange Equation. It is found that the optimal solution of the NSLQR problem can be obtained by solving a Pseudo-Differential-Riccati-Equation (PDRE) simultaneously with the closed-loop system equation. A Comparison Theorem for the PDRE is given to facilitate solution methods for the PDRE. A linear time-variant system is employed as an example in simulation to verify the proposed optimal solution. As a non-trivial application, a goal pursuit process in psychology is modeled as a NSLQR problem and two typical goal pursuit behaviors found in human and animals are reproduced using different control weighting . It is found that these two behaviors save control energy and cause less stress over Conventional Control Behavior typified by the LQR control with a constant control weighting , in situations where only the goal discrepancy at the terminal time is of concern, such as in Marathon races and target hitting missions.

Highlights

Introduction1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtf
1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtfJ(x(t),u(t))~ L(x(t),u(t),t)dtzw(x(tf ),tf ) t0 1 ðtf ~(xT (t)Q(t)x(t)zuT (t)R(x(t))u(t))dt ð1Þ 2 t0 z xTS(tf )x(tf )is minimized along the associated closed-loop system trajectory of the Linear Time-variant (LTV) system x_ (t)~A(t)x(t)zB(t)u(t) x(t0)~x0ð2Þ where u(t)[Rm is the control input, x(t)[Rn is the system state, t0 is the starting time, tf is the terminal time and x0 is the initial value of x(t) at time t0
The main result of this paper is presented: the necessary and sufficient condition of the optimality of the solution to the Non-convex State-dependent Linear Quadratic Regulator (NSLQR) problem defined in Eq (1) and (2)

Summary

Introduction

1.1 Problem Definition In this paper, we seek an optimal control law u~k(x,t), for which the performance index ðtf. J(x(t),u(t))~ L(x(t),u(t),t)dtzw(x(tf ),tf ) t0 1 ðtf ~. (xT (t)Q(t)x(t)zuT (t)R(x(t))u(t))dt ð1Þ 2 t0 z xT (tf )S(tf )x(tf ). Is minimized along the associated closed-loop system trajectory of the Linear Time-variant (LTV) system x_ (t)~A(t)x(t)zB(t)u(t) x(t0)~x0. Ð2Þ where u(t)[Rm is the control input, x(t)[Rn is the system state, t0 is the starting time, tf is the terminal time and x0 is the initial value of x(t) at time t0. The dependence of variables on t is omitted when no confusion will be introduced in the rest of the paper. It is assumed that A, B, Q are continuous LR in t, R(x) is differentiable with respect to x, and Lx is bounded

Objectives

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Apr 18, 2014
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The optimal solution of a non-convex state-dependent LQR problem and its applications.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Subspace Decomposition for Graphon LQR: Applications to VLSNs of Harmonic Oscillators
Shuang Gao ... Peter E Caines
IEEE Transactions on Control of Network Systems | VOL. 8
Shuang Gao, et. al.Shuang Gao ... Peter E Caines
02 Feb 2021
IEEE Transactions on Control of Network Systems | VOL. 8

A Hamiltonian-based solution to the linear quadratic consensus control problem

-

13 Aug 2012
13 Aug 2012

Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation
Bo Pang ... Tao Bian
IEEE Transactions on Automatic Control | VOL. 67
Bo Pang, et. al.Bo Pang ... Tao Bian
01 Jan 2021
IEEE Transactions on Automatic Control | VOL. 67

Linear quadratic regulation for linear time-varying systems with multiple input delays
Huanshui Zhang ... Lihua Xie
Automatica | VOL. 42
Huanshui Zhang, et. al.Huanshui Zhang ... Lihua Xie
30 Jun 2006
Automatica | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The optimal solution of a non-convex state-dependent LQR problem and its applications.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one