Abstract
Sparse mathematical modelling plays an increasingly important role in chemometrics due to its interpretability and prediction power. While many sparse techniques used in chemometrics rely on L1 penalization to create sparser models, Mixed Integer Optimization (MIO) achieves sparsity by imposing constraints directly in the model. In this paper, we develop an intuitive and flexible robust sparse regression framework using MIO. We use constraints and penalization to achieve sparsity and robustness respectively. We test and compare results with those obtained using other techniques generating sparser models such as LASSO and sparse PLS. We also use PLS as a baseline to compare predictive performance. We use a LIBS data set of certified reference materials (CRM) of various mineral ores to illustrate the framework using different objective functions. The MIO framework proposed improves accuracy, sparsity and robustness vs. LASSO and SPLS. MIO achieves an average R2 higher than other methods on average by at least 10.6%. Robust MIO approach also improves interpretability. It also uses 4.3 variables on average while LASSO and SPLS use 16.1 and 805.8 respectively. We also illustrate how interpretability can help build better models through examples derived from the data sets used. When adding noise to the signal, MIO achieves an R2 of 0.69 on average when all models have negative R2 values. The MIO framework proposed is versatile and could be used in other chemometrics applications.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.