Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects

José Camacho,Alberto Ferrer

doi:10.1016/j.chemolab.2013.12.003

Abstract

This is the second paper of a series devoted to provide theoretical and practical results and new algorithms for the selection of the number of Principal Components (PCs) in Principal Component Analysis (PCA) using cross-validation. The study is especially focused on the element-wise k-fold (ekf), which is among the most used algorithms for that purpose. In this paper, a taxonomy of PCA applications is proposed and it is argued that cross-validatory algorithms computing the prediction error in observable variables, like ekf, are only suited for a class of applications. A number of cross-validation methods, several of which are original, are compared in two applications of this class: missing data imputation and compression. The results show that the ekf is especially suited for missing data applications while other traditional cross-validation methods, those by Wold and Eastment and Krzanowski, are not found to provide useful outcomes in any of the two applications. These results are of special value considering that the methods investigated are computed in the main commercial software packets for chemometrics. Finally, the choice of the missing data algorithm within ekf is also investigated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Chemometrics and Intelligent Laboratory Systems	Publication Date: Dec 27, 2013
Citations: 42	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects

Abstract

Talk to us

Similar Papers

More From: Chemometrics and Intelligent Laboratory Systems

Lead the way for us

Similar Papers

Assessment of maximum likelihood PCA missing data imputation
Abel Folch‐Fortuny ... Francisco Arteaga
Journal of Chemometrics | VOL. 30
Abel Folch‐Fortuny, et. al.Abel Folch‐Fortuny ... Francisco Arteaga
08 Jun 2016
Journal of Chemometrics | VOL. 30

Missing Data Imputation Toolbox for MATLAB
Abel Folch-Fortuny ... Alberto Ferrer
Chemometrics and Intelligent Laboratory Systems | VOL. 154
Abel Folch-Fortuny, et. al.Abel Folch-Fortuny ... Alberto Ferrer
25 Mar 2016
Chemometrics and Intelligent Laboratory Systems | VOL. 154

Author response: Limitations of principal components in quantitative genetic association models for human studies
Yiqi Yao ... Alejandro Ochoa
-
Yiqi Yao, et. al.Yiqi Yao ... Alejandro Ochoa
25 Apr 2023
25 Apr 2023

Decision letter: Limitations of principal components in quantitative genetic association models for human studies
Magnus Nordborg ... Detlef Weigel
-
Magnus Nordborg, et. al.Magnus Nordborg ... Detlef Weigel
04 Jul 2022
04 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects

Abstract

Talk to us

Similar Papers

More From: Chemometrics and Intelligent Laboratory Systems