Abstract

In laser-induced breakdown spectroscopy (LIBS), identifying key emission lines for accurate elemental quantification has long posed a challenge. Traditional methods rely on experimental knowledge, atomic databases, and intricate spectral analyses. Although machine learning techniques – such as boosting algorithms and neural networks – offer efficient processing for large datasets, the complexity of these techniques often compromises interpretability. To address this issue, our study integrates the SHapley Additive exPlanations (SHAP) algorithm with gradient boosting models in order to interpret the most important spectral features, thus enhancing our understanding of how specific emission lines contribute to the carbon (C) concentration predictions in soils. Deployed on a large dataset of 1019 soil samples, a wrapper method with a random forest regressor reduced the initial spectral intensity features from 13,748 to 1098. Subsequent application of a LightGBM regression model calibrated via the Optuna framework yielded R2 of 0.98 and 0.77, and RMSE values of 1.55 and 4.54 g kg−1 for training and validation sets, respectively. The SHAP summary plot showed that C emission lines influenced the model's predictions positively, as anticipated, whereas silicon (Si) emission lines produced a negative impact, suggesting a lower C concentration in sandy soils. Our findings not only validate the efficacy of SHAP in improving LIBS-based soil C quantification, but they also offer a sophisticated framework for decoding the complex interplay between emission lines and target elemental concentrations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.