Abstract
Abstract This study employs a Linear Regression-based stacking ensemble learning approach as a novel method to enhance biodiesel conversion efficiency. Initially, a dataset derived from the literature was used to train an ensemble model that combines predictions from Random Forest, XGBoost, and Deep Neural Network (DNN) through a Linear Regression-based fusion approach. This model outperformed individual models (Random Forest: − 0.16, XGBoost: − 0.67, and DNN: 0.36) by achieving an R2 score of 0.45. To further improve model performance, 4900 synthetic data samples were generated and integrated into the dataset. Leveraging the stacking ensemble learning approach with this expanded dataset, the model demonstrated a significant improvement in predictive accuracy, achieving an R2 score of 0.81. This corresponds to an approximate 4% increase in performance compared to individual models (Random Forest: 0.78, XGBoost: 0.78, and DNN: 0.77), highlighting the effectiveness of ensemble learning in optimizing biodiesel conversion efficiency. Additionally, the model exhibited high accuracy with low error rates (MAE: 1.16 and MAPE: 1.24%), effectively compensating for the weaknesses of individual models and providing more stable and generalized predictions. To the best of our knowledge, this is the first study to incorporate a Linear Regression-based stacking method to enhance biodiesel conversion efficiency. These findings underscore the potential of ensemble learning techniques and synthetic data integration in improving renewable fuel efficiency. Future research can further enhance model performance by incorporating larger datasets and exploring more advanced ensemble strategies.
Published Version
Join us for a 30 min session where you can share your feedback and ask us any queries you have