Functional principal subspace sampling for large scale functional data analysis

Shiyuan He,Xiaomeng Yan

doi:10.1214/22-ejs2010

Abstract

Functional data analysis (FDA) methods have computational and theoretical appeals for some high dimensional data, but lack the scalability to modern large sample datasets. To tackle the challenge, we develop randomized algorithms for two important FDA methods: functional principal component analysis (FPCA) and functional linear regression (FLR) with scalar response. The two methods are connected as they both rely on the accurate estimation of functional principal subspace. The proposed algorithms draw subsamples from the large dataset at hand and apply FPCA or FLR over the subsamples to reduce the computational cost. To effectively preserve subspace information in the subsamples, we propose a functional principal subspace sampling probability, which removes the eigenvalue scale effect inside the functional principal subspace and properly weights the residual. Based on the operator perturbation analysis, we show the proposed probability has precise control over the first order error of the subspace projection operator and can be interpreted as an importance sampling for functional subspace estimation. Moreover, concentration bounds for the proposed algorithms are established to reflect the low intrinsic dimension nature of functional data in an infinite dimensional space. The effectiveness of the proposed algorithms is demonstrated upon synthetic and real datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Functional principal subspace sampling for large scale functional data analysis

Abstract

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

A data-driven soft-sensing approach using probabilistic latent variable model with functional data framework
Xiaoying Tan ... Ranran Liu
Transactions of the Institute of Measurement and Control | VOL. 46
Xiaoying Tan, et. al.Xiaoying Tan ... Ranran Liu
10 Aug 2023
Transactions of the Institute of Measurement and Control | VOL. 46

Functional data analysis of sleeping energy expenditure.
Jong Soo Lee ... Nancy F Butte
PloS one | VOL. 12
Jong Soo Lee, et. al.Jong Soo Lee ... Nancy F Butte
10 May 2017
PloS one | VOL. 12

Applying functional data analysis to assess tele-interpersonal psychotherapy's efficacy to reduce depression
Henok Woldu ... Ye Shen
Journal of Applied Statistics | VOL. 46
Henok Woldu, et. al.Henok Woldu ... Ye Shen
04 May 2018
Journal of Applied Statistics | VOL. 46

Modeling regional impacts of climate teleconnections using functional data analysis
Simon J Bonner ... Nancy E Heckman
Environmental and Ecological Statistics | VOL. 21
Simon J Bonner, et. al.Simon J Bonner ... Nancy E Heckman
19 Apr 2013
Environmental and Ecological Statistics | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Functional principal subspace sampling for large scale functional data analysis

Abstract

Talk to us

Similar Papers

More From: Electronic Journal of Statistics