Abstract

Functional time series analysis, whether based on time of frequency domain methodology, has traditionally been carried out under the assumption of complete observation of the constituent series of curves, assumed stationary. Nevertheless, as is often the case with independent functional data, it may well happen that the data available to the analyst are not the actual sequence of curves, but relatively few and noisy measurements per curve, potentially at different locations in each curve's domain. Under this sparse sampling regime, neither the established estimators of the time series' dynamics, nor their corresponding theoretical analysis will apply. The subject of this paper is to tackle the problem of estimating the dynamics and of recovering the latent process of smooth curves in the sparse regime. Assuming smoothness of the latent curves, we construct a consistent nonparametric estimator of the series' spectral density operator and use it develop a frequency-domain recovery approach, that predicts the latent curve at a given time by borrowing strength from the (estimated) dynamic correlations in the series across time. Further to predicting the latent curves from their noisy point samples, the method fills in gaps in the sequence (curves nowhere sampled), denoises the data, and serves as a basis for forecasting. Means of providing corresponding confidence bands are also investigated. A simulation study interestingly suggests that sparse observation for a longer time period, may be provide better performance than dense observation for a shorter period, in the presence of smoothness. The methodology is further illustrated by application to an environmental data set on fair-weather atmospheric electricity, which naturally leads to a sparse functional time-series.

Highlights

  • Functional data analysis constitutes a collection of statistical methods to analyse data comprised of ensembles of random functions: multiple occurrences of random processes evolving continuously in time and/or space, typically over a bounded rectangular domain [33, 11, 19, 41]

  • Panaretos and Tavakoli [28] introduced the notation of spectral density operators and harmonic principal components, capturing the complete secondorder dynamics in the frequency domain, whereas Panaretos and Tavakoli [29] showed how to estimate the said spectral density operators by smoothing the operator-valued analogue of the periodogram

  • In this article we address this gap and consider the problem of estimating the complete dynamics, and recovering the latent curves, in a stationary functional time series that is observed sparsely, irregularly, and with measurement errors

Read more

Summary

Introduction

Functional data analysis constitutes a collection of statistical methods to analyse data comprised of ensembles of random functions: multiple occurrences of random processes evolving continuously in time and/or space, typically over a bounded rectangular domain [33, 11, 19, 41]. A step further from the estimation of isolated characteristics such as the mean function and the said long-run covariance operator is to estimate the entire second-order structure of the process, without assuming linearity To this aim, Panaretos and Tavakoli [28] introduced the notation of spectral density operators and harmonic principal components, capturing the complete secondorder dynamics in the frequency domain, whereas Panaretos and Tavakoli [29] showed how to estimate the said spectral density operators by smoothing the operator-valued analogue of the periodogram. Our methodology can be interpreted in a design context: in certain applications, it might be possible for the scientist to choose how to distribute a given fixed budget of measurements over individual curves and over time In this case, one might ask how to better estimate the underlying dynamics: whether it is better to sample a functional time series more densely over shorter time-span, or to record fewer observations per curve but over a longer time-space. In Appendix A we comment on some implementation concerns, the formal proofs are included in Appendix B, and some additional results of the numerical experiments are presented in Appendix C

Functional time series framework
Observation scheme
Nonparametric estimation of the model dynamics
Spectral density kernel estimation
Periodic behaviour identification
Functional data recovery framework and confidence bands
On the choice of mixing conditions
Asymptotic results under cumulant mixing conditions
Asymptotic results under strong mixing conditions
Functional data recovery and confidence bands
Simulation setting
Estimation of the spectral density
Recovery of functional data from sparse observations
Data analysis: fair-weather athmospheric electricity
Selection of Bartlett’s span parameter L
Representation of functional data
Forecasting
Proof of Theorem 1
Proof of Theorem 2
Proof of Theorem 3
Proof of Theorem 4
Proof of Theorem 5
Proof of Theorem 6
Proof of Proposition 1 and Proposition 2
Determination of the optimal parameter L
Spectral density estimation
Functional data recovery
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call