Sparse generalized principal component analysis for large-scale applications beyond Gaussianity

Qiaoya Zhang,Yiyuan She

doi:10.4310/sii.2016.v9.n4.a11

Abstract

Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm scalability and model interpretability are difficult to achieve, not to mention the prevalence of missing values. While existing sparse PCA methods alleviate inconsistency, they are constrained to the Gaussian assumption of classical PCA and fail to address algorithm scalability issues. We generalize sparse PCA to the broad exponential family distributions under high-dimensional setup, with built-in treatment for missing values. Meanwhile we propose a family of iterative sparse generalized PCA (SG-PCA) algorithms such that despite the non-convexity and non-smoothness of the optimization task, the loss function decreases in every iteration. In terms of ease and intuitive parameter tuning, our sparsity-inducing regularization is far superior to the popular Lasso. Furthermore, to promote overall scalability, accelerated gradient is integrated for fast convergence, while a progressive screening technique gradually squeezes out nuisance dimensions of a large-scale problem for feasible optimization. High-dimensional simulation and real data experiments demonstrate the efficiency and efficacy of SG-PCA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sparse generalized principal component analysis for large-scale applications beyond Gaussianity

Abstract

Talk to us

Similar Papers

More From: Statistics and Its Interface

Lead the way for us

Journal: Statistics and Its Interface	Publication Date: Jan 1, 2016
Citations: 1

Similar Papers

Author response: Sparse dimensionality reduction approaches in Mendelian randomisation with highly correlated exposures
Vasileios Karageorgiou ... Verena Zuber
-
Vasileios Karageorgiou, et. al.Vasileios Karageorgiou ... Verena Zuber
28 Nov 2022
28 Nov 2022

Sparse group principal component analysis using elastic-net regularisation and its application to virtual metrology in semiconductor manufacturing
Geonseok Lee ... Myong-Kee Jeong
International Journal of Production Research | VOL. ahead-of-print
Geonseok Lee, et. al.Geonseok Lee ... Myong-Kee Jeong
07 Jun 2024
International Journal of Production Research | VOL. ahead-of-print

Sparse Variable PCA Using Geodesic Steepest Descent
M.O Ulfarsson ... V Solo
IEEE Transactions on Signal Processing | VOL. 56
M.O Ulfarsson, et. al.M.O Ulfarsson ... V Solo
01 Dec 2008
IEEE Transactions on Signal Processing | VOL. 56

SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.
Seokho Lee ... Jianhua Z Huang
-
Seokho Lee, et. al.Seokho Lee ... Jianhua Z Huang
19 Jun 2015
19 Jun 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse generalized principal component analysis for large-scale applications beyond Gaussianity

Abstract

Talk to us

Similar Papers

More From: Statistics and Its Interface