Abstract

A new variable selection method called stability competitive adaptive reweighted sampling (SCARS) is proposed based on competitive adaptive reweighted sampling (CARS). In SCARS, variable is selected by an index of stability that is defined as the absolute value of regression coefficient divided by its standard deviation. SCARS algorithm consists of a number of loops. In each loop, the stability of each variable is computed. Then based on stability, enforced wavelength selection and adaptive reweighted sampling (ARS) is used to select important variables. The selected variables are kept as a variable subset and further used in the next loop. After running the loops, a number of subsets of variables are obtained and root mean squared error of cross validation (RMSECV) of PLS models established with subsets of variables is computed. The subset of variables with the lowest RMSECV is considered as the optimal variable subset. The performance of the proposed algorithm is evaluated by three near-infrared (NIR) datasets: tobacco, corn and glucose datasets. The results show that SCARS can select the least variables and supply the least RMSECV and latent variable number of the PLS model comparing with methods of Moving Window PLS (MWPLS), MCUVE and CARS.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.