Abstract

The linear relationship between optical absorbance and the concentration of analytes—as postulated by the Beer-Lambert law—is one of the fundamental assumptions that much of the optical spectroscopy literature is explicitly or implicitly based upon. The common use of linear regression models such as principal component regression and partial least squares exemplifies how the linearity assumption is upheld in practical applications. However, the literature also establishes that deviations from the Beer-Lambert law can be expected when (a) the light source is far from monochromatic, (b) the concentrations of analytes are very high and (c) the medium is highly scattering. The lack of a quantitative understanding of when such nonlinearities can become predominant, along with the mainstream use of nonlinear machine learning models in different fields, have given rise to the use of methods such as random forests, support vector regression, and neural networks in spectroscopic applications. This raises the question that, given the small number of samples and the high number of variables in many spectroscopic datasets, are nonlinear effects significant enough to justify the additional model complexity? In the present study, we empirically investigate this question in relation to lactate, an important biomarker. Particularly, to analyze the effects of scattering matrices, three datasets were generated by varying the concentration of lactate in phosphate buffer solution, human serum, and sheep blood. Additionally, the fourth dataset pertained to invivo, transcutaneous spectra obtained from healthy volunteers in an exercise study. Linear and nonlinear models were fitted to each dataset and measures of model performance were compared to attest the assumption of linearity. To isolate the effects of high concentrations, the phosphate buffer solution dataset was augmented with six samples with very high concentrations of lactate between (100–600 mmol/L). Subsequently, three partly overlapping datasets were extracted with lactate concentrations varying between 0–11, 0–20 and 0–600 mmol/L. Similarly, the performance of linear and nonlinear models were compared in each dataset. This analysis did not provide any evidence of substantial nonlinearities due high concentrations. However, the results suggest that nonlinearities may be present in scattering media, justifying the use of complex, nonlinear models.

Highlights

  • The linear relationship between optical absorbance and the concentration of analytes—as postulated by the Beer-Lambert law—is one of the fundamental assumptions that much of the optical spectroscopy literature is explicitly or implicitly based upon

  • Minor improvements in predictive performance might be expected from Partial Least Squares (PLS), when noise constitutes much of the variance in the independent variables ­space[5,6,7]

  • The performance of seven linear and nonlinear models are compared to investigate the extent of nonlinearities caused by high concentrations and scattering matrices

Read more

Summary

Introduction

The linear relationship between optical absorbance and the concentration of analytes—as postulated by the Beer-Lambert law—is one of the fundamental assumptions that much of the optical spectroscopy literature is explicitly or implicitly based upon. Near Infared (NIR), Mid-Infrared (mid-IR) visible and Ultraviolet (UV) optical spectroscopy provide a low-cost and non-invasive alternative to electrode-based approaches in the characterization of chemical compounds and the quantification of analytes Such applications necessitate training predictive models on datasets with sufficient variations in the concentration of absorbing species. The presence of redundant variables and the multicollinearity of absorbance values mean the resulting covariance matrices are low rank, under uniform attenuation conditions, the Beer-Lambert law postulates a linear relationship between the absorbance of monochromatic light and the concentration of absorbing species, N. i=1 where A denotes absorbance, Io and I are the intensity of beam before and after passing through the absorbing layer, N is the number of absorbing species in the matrix, ǫi is the molar decadic extinction coefficient for the ­ith absorbing species, ci is the concentration, and l is the path length of light. Minor improvements in predictive performance might be expected from PLS, when noise constitutes much of the variance in the independent variables ­space[5,6,7]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.