Abstract

Principal component analysis (PCA) or singular value decomposition (SVD) are multivariate techniques that are often used to compress large data matrices to a relevant size. Subsequent data analysis then proceeds with the model representation of the data. In this first paper expressions are derived for the prediction of the bias in the eigenvalues of PCA and singular values of SVD that results from random measurement errors in the data. Theoretical expressions for the prediction of this “random error bias” have been given in the statistics literature. These results are, however, restricted to the case that only one principal component (PC) is significant. The first objective of this paper is to extend these results to an arbitrary number of significant PCs. For the generalization Malinowski's error functions are used. A signal-to-noise ratio is defined that describes the error situation for each individual PC. This definition enhances the interpretability of the derived expressions. The adequacy of the derived expressions is tested by a limited Monte Carlo study. This finally leads to the second objective of this paper. Simulation results are always restricted to the class of data that is well represented in the study. Thus rather than giving extensive simulation results it is outlined how the validation and evaluation of theoretical predictions can proceed for a specific application in practice.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.