Abstract

While both Tucker3 and PARAFAC models can be viewed as latent variable models extending principal component analysis (PCA) to multi-way data, most fundamental properties of PCA do not extend to both models. This has practical importance, which will be explained in this paper. The fundamental difference between the PARAFAC and the Tucker3 model can be viewed as the difference between so-called low-rank and subspace approximation of the data. This insight is used to pose a modification of the multi-linear partial least squares regression (N-PLS) model. The modification is found by exploiting the basic properties of PLS and of multi-way models. Compared to the current prevalent implementation of N-PLS, the new model provides a more reasonable fit to the independent data and exactly the same predictions of the dependent variables. Thus, the reason for introducing this improved model is not to obtain better predictions, but rather the aim is to improve the secondary aspect of PLS: the modeling of the independent variables. The original version of N-PLS has some built-in problems that are easily circumvented with the modification suggested here. This is of importance, for example, in process monitoring, outlier detection and also, implicitly, for jackknifing of model parameters. Some examples are provided to illustrate some of these points.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.