Abstract

This work aims to develop an auto-machine learning method using Mid-Infrared (MIR) spectroscopy data to determine the cold filter plugging point (CFPP) and kinematic viscosity at 40 ºC of biodiesel, diesel, and mixtures samples. The biodiesel was obtained by the transesterification reaction and later purified. The first dataset was composed of 108 blends (biodiesel obtained from different biomass such as soy, corn, sunflower, and canola) with binary, ternary and quaternary mixtures. The second dataset was composed of 227 blends of diesel-biodiesel and diesel-biodiesel-ethanol, respectively. The physical properties of the samples were obtained according to ABNT NBR 14747 and ABNT NBR 10441, respectively. The MIR Spectroscopy data were acquired from 7,800 to 450 cm-1, with a 4 cm-1 resolution and 20 scans. The spectra' baseline alignment was carried out using the asymmetric least squares method. A Savitzky–Golay filter was applied to a set of digital data points to smooth the data. This work used a first-order polynomial and a zero derivative function to smooth the spectra. The dataset was split into training and test sets using the function CreateDataPartition from the caret package. It was adopted 70% for training and 30% for test sets. In this work, the model training process was carried out using the open-source Python library LazyPredict. The LazyPredict returns the trained models and their performance metrics. The kinematic viscosity at 40 ºC of the biodiesel samples and their blends could be modeled using the MIR Spectroscopy dataset using different auto-machine learning algorithms. The RMSEP (Root Mean Square Error of Prediction) (≤ 0.02 mm2 s-1) was similar to the experimental error obtained after log transformation. The CFPP of the biodiesel samples and their blends could be modeled using the MIR Spectroscopy dataset by different auto-machine learning algorithms with an RMSEP (≤ 1.6 ºC) similar to the experimental error obtained by traditional methodology. Based on the lower computational time and the same performance observed by the RMSEP and R2 (coefficient of determination) values from different algorithms, it is recommended to use Ridge or Ridge Cross-Validation Regression models for both physical properties using MIR Spectroscopy data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call