Abstract

Partial least square discriminant analysis (PLS-DA) has achieved a huge success in many research systems, such as quality evaluation of agricultural products and drug analysis. However, it can be found that PLS-DA has various forms which can be mainly divided into two strategies separately called as PLS1-DA and PLS2-DA which is overused. In this work, we proposed a novel strategy not only to select representative samples for modeling, but also to build qualitative models, which is called as principal component analysis Euclidean distance PLS1-DA (PCA-EuD-PLS1-DA). Outliers were detected by the method Leverage. It is very interesting to compare the results of traditional PLS-DA tactics with the proposed method on six datasets. EuD-PLS1-DA with PCA partitioning subsets has the optimal performances than both PLS2-DA and assigned PLS1-DA when confronting multi-class problems. In particular, different classes’ tablets in NIR can be well discriminated and different classes’ tablets in Raman can be discriminated in all EuD-PLS1-DA models; and their prediction accuracy is above 98% and 80%, respectively. Their prediction accuracy for PLS2-DA is above 56% and 26%, respectively. And all RMSECV in EuD-PLS1-DA is smaller than the remaining classifiers; so does RMSEP. This behavior means EuD-PLS1-DA has a better fitness than the remaining methods. When there are only two classes in a dataset, performance of all PLS-DA models is highly similar. Before comparing the results of PLS-DA derivatives, the shortcomings in the other two methods have been systematically described. Sample selection algorithms including K–S, SPXY and PCA were compared with percentage of chosen samples in every class. And the current ratios are mainly in the area of 70–90%. Besides, ratios of every class’ samples selected for modeling are highly similar among K–S, SPXY and PCA. Most importantly, the proposed method can be realized in most statistical software.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.