Online Learning of the Kalman Filter With Logarithmic Regret

Anastasios Tsiamis,George J Pappas

doi:10.1109/tac.2022.3207670

Anastasios Tsiamis, George J Pappas

Open Access

https://doi.org/10.1109/tac.2022.3207670

Copy DOI

Abstract

In this paper, we consider the problem of predicting observations generated online by an unknown, partially observable linear system, which is driven by Gaussian noise. In the linear Gaussian setting, the optimal predictor in the mean square error sense is the celebrated Kalman filter, which can be explicitly computed when the system model is known. When the system model is unknown, we have to learn how to predict observations online based on finite data, suffering possibly a non-zero regret with respect to the Kalman filter's prediction. We show that it is possible to achieve a regret of the order of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\text{poly}\log (\mathsf {N})$</tex-math></inline-formula> with high probability, where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$N$</tex-math></inline-formula> is the number of observations collected. This is achieved using an online least-squares algorithm, which exploits the approximately linear relation between future observations and past observations. The regret analysis is based on the stability properties of the Kalman filter, recent statistical tools for finite sample analysis of system identification, and classical results for the analysis of least-squares algorithms for time series. Our regret analysis can also be applied to other predictors, e.g. multiple step-ahead prediction, or prediction under exogenous inputs including closed-loop prediction. A fundamental technical contribution is that our bounds hold even for the class of non-explosive systems (including marginally stable systems), which was not addressed before in the case of online prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Learning of the Kalman Filter With Logarithmic Regret

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Journal: IEEE Transactions on Automatic Control	Publication Date: May 1, 2023
Citations: 15

Similar Papers

Initial alignment of Inertial Navigation System based on a predictive iterated Kalman filter
Guanghao Cheng ... Lei Guo
-
Guanghao Cheng, et. al.Guanghao Cheng ... Lei Guo
01 Jul 2018
01 Jul 2018

Enhancing seamless INS/UWB integrated localization using LS-SVM assisted predictive adaptive Kalman filter
Yuan Xu ... Ning Feng
-
Yuan Xu, et. al.Yuan Xu ... Ning Feng
01 Jul 2019
01 Jul 2019

Normalized unscented Kalman filter and normalized unscented RTS smoother for nonlinear state-space model identification
Masaya Murata ... Kunio Kashino
-
Masaya Murata, et. al.Masaya Murata ... Kunio Kashino
01 Jun 2013
01 Jun 2013

Star-sensor-based predictive Kalman filter for satellite attitude estimation
Yurong Lin ... Zhenglong Deng
Science in China Series F Information Sciences | VOL. 45
Yurong Lin, et. al.Yurong Lin ... Zhenglong Deng
01 Jun 2002
Science in China Series F Information Sciences | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Learning of the Kalman Filter With Logarithmic Regret

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control