Variable selection in support vector regression using angular search algorithm and variance inflation factor

Gabriely S Folli,Márcia H.C Nascimento,Ellisson H De Paulo,Pedro H.P Da Cunha,Paulo R Filgueiras,Wanderson Romão

doi:10.1002/cem.3282

Abstract

AbstractHere, we combine angular search algorithm and variance inflation factor (ASA‐VIF) with support vector regression (SVR) (ASA‐VIF‐SVR) to estimate total acid number (TAN), basic nitrogen content (BNC), and sulfur content (SC) in Brazilian crude oils. To prevent the interference of outliers, we further developed a strategy for outlier identification and applied it to nonlinear models based on RMSE (root mean square error). ASA‐VIF‐SVR was applied to near‐ and mid‐infrared spectroscopy (NIR and MIR) and hydrogen nuclear magnetic resonance (1H NMR) spectroscopy data available in a range of 93–194 samples. The models were evaluated for accuracy (root mean square error of calibration [RMSEC] and root mean square error of prediction [RMSEP]) and linearity (coefficient of determination, R2). The removal of outliers increased accuracy and linearity of our models. The ASA‐VIF model for TAN, BNC, and SC selected 0.37%, 0.93%, and 0.30% of variables from full NIR spectra; 0.21%, 0.27%, and 0.21% from full MIR; and 0.20%, 0.42%, and 0.15% from full 1H NMR. In most cases, the best results were obtained with variable selection compared with the full dataset. Also, 1H NMR generated more accurate and linear models with RMSEP and R2p of 0.0071 wt% and 0.86 for BNC and 0.0623 wt% and 0.79 for SC. TAN showed a better MIR result with RMSEP of 0.1426 mg KOH g–1 and R2p of 0.47. The most important region for 1H NMR and MIR was the one with the largest quantity of unpaired electrons (aromatic region).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Variable selection in support vector regression using angular search algorithm and variance inflation factor

Abstract

Talk to us

Similar Papers

More From: Journal of Chemometrics

Lead the way for us

Journal: Journal of Chemometrics	Publication Date: Jul 18, 2020
Citations: 17

Similar Papers

Liquid-liquid microextraction method using a deep eutectic solvent for the determination of total paraben content by fluorescence spectroscopy and second-order calibration
Daniella Iris Oliveira Silva ... Márcio José Coelho Pontes
Microchemical Journal | VOL. 193
Daniella Iris Oliveira Silva, et. al.Daniella Iris Oliveira Silva ... Márcio José Coelho Pontes
25 Jul 2023
Microchemical Journal | VOL. 193

Four chemometric models enhanced by Latin hypercube sampling design for quantification of anti-COVID drugs: sustainability profiling through multiple greenness, carbon footprint, blueness, and whiteness metrics
Noha S Katamesh ... Shimaa A Mahmoud
BMC Chemistry | VOL. 18
Noha S Katamesh, et. al.Noha S Katamesh ... Shimaa A Mahmoud
18 Mar 2024
BMC Chemistry | VOL. 18

Comparison between partial least square and support vector regression with a genetic algorithm wavelength selection method for the simultaneous determination of some oxygenate compounds in gasoline by FTIR spectroscopy
Ahmad Asghari ... Amir Bagheri Garmarudi
Infrared Physics & Technology | VOL. 105
Ahmad Asghari, et. al.Ahmad Asghari ... Amir Bagheri Garmarudi
28 Dec 2019
Infrared Physics & Technology | VOL. 105

Rapid determination of benzalkonium chloride in aqueous samples by FTIR spectroscopy in tandem with chemometrics
Ahmad Asghari ... Sina Darzi
Infrared Physics & Technology | VOL. 116
Ahmad Asghari, et. al.Ahmad Asghari ... Sina Darzi
28 Apr 2021
Infrared Physics & Technology | VOL. 116

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variable selection in support vector regression using angular search algorithm and variance inflation factor

Abstract

Talk to us

Similar Papers

More From: Journal of Chemometrics