Robust PCA for high‐dimensional data based on characteristic transformation

Lingyu He,Bo Zhang,Yanrong Yang

doi:10.1111/anzs.12385

Abstract

SummaryIn this paper, we propose a novel robust principal component analysis (PCA) for high‐dimensional data in the presence of various heterogeneities, in particular strong tailing and outliers. A transformation motivated by the characteristic function is constructed to improve the robustness of the classical PCA. The suggested method has the distinct advantage of dealing with heavy‐tail‐distributed data, whose covariances may be non‐existent (positively infinite, for instance), in addition to the usual outliers. The proposed approach is also a case of kernel principal component analysis (KPCA) and employs the robust and non‐linear properties via a bounded and non‐linear kernel function. The merits of the new method are illustrated by some statistical properties, including the upper bound of the excess error and the behaviour of the large eigenvalues under a spiked covariance model. Additionally, using a variety of simulations, we demonstrate the benefits of our approach over the classical PCA. Finally, using data on protein expression in mice of various genotypes in a biological study, we apply the novel robust PCA to categorise the mice and find that our approach is more effective at identifying abnormal mice than the classical PCA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust PCA for high‐dimensional data based on characteristic transformation

Abstract

Talk to us

Similar Papers

More From: Australian & New Zealand Journal of Statistics

Lead the way for us

Similar Papers

Robust vs. classical principalcomponent analysis in the presence of outliers
Sunil K Sapra
Applied Economics Letters | VOL. 17
Sunil K SapraSunil K Sapra
14 Apr 2010
Applied Economics Letters | VOL. 17

Entropy-based robust PCA for communication network anomaly detection
Duo Liu ... Biswajit Nandy
-
Duo Liu, et. al.Duo Liu ... Biswajit Nandy
01 Oct 2014
01 Oct 2014

A robust kernel PCA algorithm
Cong-De Lu ... Can-Ping Li
-
Cong-De Lu, et. al. Cong-De Lu ... Can-Ping Li
26 Aug 2004
26 Aug 2004

Improved Statistical Fault Detection Technique and Application to Biological Phenomena Modeled by S-Systems.
Majdi Mansouri ... Mohamed N Nounou
IEEE Transactions on NanoBioscience | VOL. 16
Majdi Mansouri, et. al.Majdi Mansouri ... Mohamed N Nounou
12 Jul 2017
IEEE Transactions on NanoBioscience | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust PCA for high‐dimensional data based on characteristic transformation

Abstract

Talk to us

Similar Papers

More From: Australian &amp; New Zealand Journal of Statistics

More From: Australian & New Zealand Journal of Statistics