A study of machine learning regression methods for major elemental analysis of rocks using laser-induced breakdown spectroscopy

Thomas F Boucher,Marie V Ozanne,Marco L Carmosino,M Darby Dyar,Sridhar Mahadevan,Elly A Breves,Kate H Lepore,Samuel M Clegg

doi:10.1016/j.sab.2015.02.003

Abstract

The ChemCam instrument on the Mars Curiosity rover is generating thousands of LIBS spectra and bringing interest in this technique to public attention. The key to interpreting Mars or any other types of LIBS data are calibrations that relate laboratory standards to unknowns examined in other settings and enable predictions of chemical composition. Here, LIBS spectral data are analyzed using linear regression methods including partial least squares (PLS-1 and PLS-2), principal component regression (PCR), least absolute shrinkage and selection operator (lasso), elastic net, and linear support vector regression (SVR-Lin). These were compared against results from nonlinear regression methods including kernel principal component regression (K-PCR), polynomial kernel support vector regression (SVR-Py) and k-nearest neighbor (kNN) regression to discern the most effective models for interpreting chemical abundances from LIBS spectra of geological samples. The results were evaluated for 100 samples analyzed with 50 laser pulses at each of five locations averaged together. Wilcoxon signed-rank tests were employed to evaluate the statistical significance of differences among the nine models using their predicted residual sum of squares (PRESS) to make comparisons. For MgO, SiO2, Fe2O3, CaO, and MnO, the sparse models outperform all the others except for linear SVR, while for Na2O, K2O, TiO2, and P2O5, the sparse methods produce inferior results, likely because their emission lines in this energy range have lower transition probabilities. The strong performance of the sparse methods in this study suggests that use of dimensionality-reduction techniques as a preprocessing step may improve the performance of the linear models. Nonlinear methods tend to overfit the data and predict less accurately, while the linear methods proved to be more generalizable with better predictive performance. These results are attributed to the high dimensionality of the data (6144 channels) relative to the small number of samples studied. The best-performing models were SVR-Lin for SiO2, MgO, Fe2O3, and Na2O, lasso for Al2O3, elastic net for MnO, and PLS-1 for CaO, TiO2, and K2O. Although these differences in model performance between methods were identified, most of the models produce comparable results when p≤0.05 and all techniques except kNN produced statistically-indistinguishable results. It is likely that a combination of models could be used together to yield a lower total error of prediction, depending on the requirements of the user.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A study of machine learning regression methods for major elemental analysis of rocks using laser-induced breakdown spectroscopy

Abstract

Talk to us

Similar Papers

More From: Spectrochimica Acta Part B: Atomic Spectroscopy

Lead the way for us

Journal: Spectrochimica Acta Part B: Atomic Spectroscopy	Publication Date: Feb 12, 2015
Citations: 151

Similar Papers

Prediction of the concentration of standardized ileal digestible amino acids in distillers dried grains with solubles1
P E Urriola ... G C Shurson
Journal of Animal Science | VOL. 91
P E Urriola, et. al.P E Urriola ... G C Shurson
26 Jul 2013
Journal of Animal Science | VOL. 91

A PRESS statistic for working correlation structure selection in generalized estimating equations
Gul Inan ... John Preisser
Journal of Applied Statistics | VOL. 46
Gul Inan, et. al.Gul Inan ... John Preisser
11 Aug 2018
Journal of Applied Statistics | VOL. 46

PRESS model selection in repeated measures data
Honghu Liu ... Neil S Wenger
Computational Statistics and Data Analysis | VOL. 30
Honghu Liu, et. al.Honghu Liu ... Neil S Wenger
01 Apr 1999
Computational Statistics and Data Analysis | VOL. 30

Calibration in near Infrared Diffuse Reflectance Spectroscopy. A Comparative Study of Various Methods
M Blanco ... J Coello
Journal of Near Infrared Spectroscopy | VOL. 5
M Blanco, et. al.M Blanco ... J Coello
01 Mar 1997
Journal of Near Infrared Spectroscopy | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A study of machine learning regression methods for major elemental analysis of rocks using laser-induced breakdown spectroscopy

Abstract

Talk to us

Similar Papers

More From: Spectrochimica Acta Part B: Atomic Spectroscopy