Efficient inference in state-space models through adaptive learning in online Monte Carlo expectation maximization

Donna Henderson,Gerton Lunter

doi:10.1007/s00180-019-00937-4

Abstract

Expectation maximization (EM) is a technique for estimating maximum-likelihood parameters of a latent variable model given observed data by alternating between taking expectations of sufficient statistics, and maximizing the expected log likelihood. For situations where sufficient statistics are intractable, stochastic approximation EM (SAEM) is often used, which uses Monte Carlo techniques to approximate the expected log likelihood. Two common implementations of SAEM, Batch EM (BEM) and online EM (OEM), are parameterized by a “learning rate”, and their efficiency depend strongly on this parameter. We propose an extension to the OEM algorithm, termed Introspective Online Expectation Maximization (IOEM), which removes the need for specifying this parameter by adapting the learning rate to trends in the parameter updates. We show that our algorithm matches the efficiency of the optimal BEM and OEM algorithms in multiple models, and that the efficiency of IOEM can exceed that of BEM/OEM methods with optimal learning rates when the model has many parameters. Finally we use IOEM to fit two models to a financial time series. A Python implementation is available at https://github.com/luntergroup/IOEM.git.

Highlights

Expectation Maximization (EM) is a general and widely used technique for estimating maximum likelihood parameters of latent variable models (Dempster et al 1977)
We propose an extension to the online EM (OEM) algorithm, termed Introspective Online Expectation Maximization (IOEM), which removes the need for specifying this parameter by adapting the learning rate to trends in the parameter updates
We show that our algorithm matches the efficiency of the optimal Batch EM (BEM) and OEM algorithms in multiple models, and that the efficiency of IOEM can exceed that of BEM/OEM methods with optimal learning rates when the model has many parameters

Summary

Introduction

Expectation Maximization (EM) is a general and widely used technique for estimating maximum likelihood parameters of latent variable models (Dempster et al 1977). Le Corff and Fort (2013) introduced a “block online” EM algorithm for hidden Markov models that combines online and batch ideas, controlling convergence through a block size sequence τk All these algorithms require choosing tuning parameters in the form of a batch size, block sequence, learning rate or a learning schedule. In the context of (stochastic) gradient descent optimization (Bottou 2012), several influential adaptive algorithms have recently been proposed (Zeiler 2012; Kingma and Ba 2015; Mandt et al 2016; Reddi et al 2018) that have few or no tuning parameters In principle, these methods can be used to find maximum likelihood parameters, but unless data is processed in batches, applying these methods to state-space models with a sequential structure is not straightforward.

EM algorithms for a simplified autoregressive model

Batch expectation maximization

Online expectation maximization

Introspective online expectation maximization

The IOEM algorithm for the full autoregressive model

Simulations

Inference with the simplified IOEM algorithm

Inference with the complete IOEM algorithm

Inference of multiple parameters

Inference of parameters of a stochastic volatility model

Application to financial time series

Conclusion

Fixed-lag technique

Weighted regression

Pseudo-independent parameter updates

Associated methods

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient inference in state-space models through adaptive learning in online Monte Carlo expectation maximization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Statistics

Lead the way for us

Journal: Computational Statistics	Publication Date: Dec 3, 2019
License type: open-access

Similar Papers

Performance in population models for count data, part II: A new SAEM algorithm
Radojka Savic ... Marc Lavielle
Journal of Pharmacokinetics and Pharmacodynamics | VOL. 36
Radojka Savic, et. al.Radojka Savic ... Marc Lavielle
01 Aug 2009
Journal of Pharmacokinetics and Pharmacodynamics | VOL. 36

Maximum likelihood from spatial random effects models via the stochastic approximation expectation maximization algorithm
Hongtu Zhu ... Bradley Peterson
Statistics and Computing | VOL. 17
Hongtu Zhu, et. al.Hongtu Zhu ... Bradley Peterson
27 Jan 2007
Statistics and Computing | VOL. 17

A stochastic approximation expectation maximization algorithm for estimating Ramsay-curve three-parameter normal ogive model with non-normal latent trait distributions.
Yuzheng Cui ... Ningzhong Shi
Frontier in Psychology | VOL. 13
Yuzheng Cui, et. al.Yuzheng Cui ... Ningzhong Shi
24 Nov 2022
Frontier in Psychology | VOL. 13

Recursive channel estimation for wireless communication via the EM algorithm
H Zamiri-Jafarian ... S Pasupathy
-
H Zamiri-Jafarian, et. al.H Zamiri-Jafarian ... S Pasupathy
17 Dec 1997
17 Dec 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient inference in state-space models through adaptive learning in online Monte Carlo expectation maximization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Statistics