Abstract

High-temperature gas chromatography (HTGC) is an analytical technique employed in the petroleum industry for component separation. By incorporating chemometrics, HTGC data can be effectively utilized to predict various properties of crude oil. However, HTGC chromatograms generate a substantial number of variables, some of which may lack pertinent chemical information. Consequently, employing variable selection methods becomes crucial to reduce the number of variables and enhance the predictive capability of calibration models. In this study, the interval partial least squares (iPLS), synergy interval partial least squares (siPLS), and ordered predictors selection (OPS) methods were applied for variable selection to construct linear regression models. The main objective was to investigate the potential of these methods in predicting eight properties of crude oil: American Petroleum Institute (API) gravity, standardized kinematic viscosity at 50 °C (VISp), flash point (FP), Reid vapor pressure (RVP), micro carbon residue (MCR), saturates (SAT), aromatics (ARO), and polar (POL) content. While all variable selection methods yielded satisfactory results, the OPS-PLS regression models consistently exhibited the best performance in estimating these properties, achieving root mean squared error of prediction (RMSEP) values of 1.244 for API, 0.029 for VISp, 15.356 °C for FP, 0.324 kPa for RVP, 0.629 wt% for MCR, 3.691 wt% for SAT, 2.939 wt% for ARO, and 3.374 wt% for POL. Variable selection demonstrated remarkable effectiveness, significantly improving the accuracy of the models, and allowing for the creation of concise models with a focused set of variables.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call