Abstract

In this paper, the problem of inverse quadratic optimal control over finite time-horizon for discrete-time linear systems is considered. Our goal is to recover the corresponding quadratic objective function using noisy observations. First, the identifiability of the model structure for the inverse optimal control problem is analyzed under relative degree assumption and we show the model structure is strictly globally identifiable. Next, we study the inverse optimal control problem whose initial state distribution and the observation noise distribution are unknown, yet the exact observations on the initial states are available. We formulate the problem as a risk minimization problem and approximate the problem using empirical average. It is further shown that the solution to the approximated problem is statistically consistent under the assumption of relative degrees. We then study the case where the exact observations on the initial states are not available, yet the observation noises are known to be white Gaussian distributed and the distribution of the initial state is also Gaussian (with unknown mean and covariance). EM-algorithm is used to estimate the parameters in the objective function. The effectiveness of our results are demonstrated by numerical examples.

Highlights

  • Since first proposed by [1], there has been a numerous applications of inverse optimal control (IOC) [2,3,4]

  • IOC has been widely developed as a powerful tool to help us understand the optimality criteria underlying biological motions, which could used for control synthesis of

  • Speaking, our goal is to reconstruct the unknown parameters in the quadratic objective function using the given discrete-time linear system dynamics and its noisy measurement of output

Read more

Summary

Introduction

Since first proposed by [1], there has been a numerous applications of inverse optimal control (IOC) [2,3,4]. In [10], given the noisy state observations, the discrete infinite time-horizon case is studied in which the optimal feedback gain K is time-i­nvariant. Their approach is to first identify the feedback matrix K from noisy observations by a maximumlikelihood estimator. In [16], the authors consider the discrete-time IOC for nonlinear systems when some segments of the trajectories and input observations are missing Their analysis for the identifiability does not accord with the other definition of identifiability and statistical consistency cannot be guaranteed. Denotes Kronecker product and [H]i denotes the ith row of matrix H

Problem formulation
Model identifiability
IOC using exact initial values and noisy output
BBT 00
IOC under Gaussian assumption
Numerical examples
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call