Abstract
In analytical chemistry spectroscopy is attractive for high-throughput quantification, which often relies on inverse regression, like partial least squares regression. Due to a multivariate nature of spectroscopic measurements an analyte can be quantified in presence of interferences. However, if the model is not fully selective against interferences, analyte predictions may be biased. The degree of model selectivity against an interferent is defined by the inner relation between the regression vector and the pure interfering signal. If the regression vector is orthogonal to the signal, this inner relation equals zero and the model is fully selective. The degree of model selectivity largely depends on calibration data quality. Strong correlations may deteriorate calibration data resulting in poorly selective models. We show this using a fructose-maltose model system. Furthermore, we modify the NIPALS algorithm to improve model selectivity when calibration data are deteriorated. This modification is done by incorporating a projection matrix into the algorithm, which constrains regression vector estimation to the null-space of known interfering signals. This way known interfering signals are handled, while unknown signals are accounted for by latent variables. We test the modified algorithm and compare it to the conventional NIPALS algorithm using both simulated and industrial process data. The industrial process data consist of mid-infrared measurements obtained on mixtures of beta-lactoglobulin (analyte of interest), and alpha-lactalbumin and caseinoglycomacropeptide (interfering species). The root mean squared error of beta-lactoglobulin (% w/w) predictions of a test set was 0.92 and 0.33 when applying the conventional and the modified NIPALS algorithm, respectively. Our modification of the algorithm returns simpler models with improved selectivity and analyte predictions. This paper shows how known interfering signals may be utilized in a direct fashion, while benefitting from a latent variable approach. The modified algorithm can be viewed as a fusion between ordinary least squares regression and partial least squares regression and may be very useful when knowledge of some (but not all) interfering species is available.
Highlights
Vibrational spectroscopic techniques, including mid-infrared spectroscopy (MIRS), are attractive to both academia and industry, compared to traditional wet chemical analysis
Rather than collecting an additional data set of interference, we take advantage of known interfering signals, as suggested by Ferre and Brown [22], and we show how the estimated regression vector can be constrained in the null-space of these known interferents, while latent variable (LV) estimation accounts for unknown interferences
The results in this paper highlight the importance of considering the space spanned by sample measurements when doing inverse regression modeling
Summary
Vibrational spectroscopic techniques, including mid-infrared spectroscopy (MIRS), are attractive to both academia and industry, compared to traditional wet chemical analysis. If a PLS model, predicting an analyte, is not fully selective against interferences, the analyte prediction will depend on the quantities of these interferences This will compromise model validity and robustness [6,7]. We show how the configuration of calibration data affects PLS model selectivity, and we propose a modification of the non-linear iterative partial least squares (NIPALS) algorithm for PLS regression, which ensures model selectivity against known interfering signals. This may result in less complex PLS models with lower prediction error uncertainties and better selectivity
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.