Machine learning framework to predict product distribution of lignocellulosic biomass pyrolysis.

  • Abstract
  • References
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Machine learning framework to predict product distribution of lignocellulosic biomass pyrolysis.

ReferencesShowing 10 of 55 papers
  • Cite Count Icon 1757
  • 10.1016/j.rser.2015.12.185
Lignocellulosic biomass pyrolysis: A review of product properties and effects of pyrolysis parameters
  • Jan 8, 2016
  • Renewable and Sustainable Energy Reviews
  • Tao Kan + 2 more

  • Cite Count Icon 14
  • 10.1016/j.jclepro.2023.137472
Production of bio-oil from lychee-based biomass through pyrolysis and maximization of bio-oil yield with statistical and machine learning techniques
  • May 12, 2023
  • Journal of Cleaner Production
  • Yashvir Singh + 8 more

  • Cite Count Icon 11
  • 10.1016/j.cherd.2018.04.021
A Multigene Genetic Programming approach for modeling effect of particle size in a liquid–solid circulating fluidized bed reactor
  • Apr 25, 2018
  • Chemical Engineering Research and Design
  • Shaikh A Razzak + 4 more

  • Cite Count Icon 14
  • 10.1016/j.jaap.2022.105546
Estimation of pyrolysis liquid product yield and its hydrogen content for biomass resources by combined evaluation of pyrolysis conditions with proximate-ultimate analysis data: A machine learning application
  • May 12, 2022
  • Journal of Analytical and Applied Pyrolysis
  • Şeyda Taşar

  • Cite Count Icon 344
  • 10.1016/j.ces.2007.11.024
Nonlinear parameter estimation through particle swarm optimization
  • Nov 19, 2007
  • Chemical Engineering Science
  • Marcio Schwaab + 3 more

  • Cite Count Icon 5
  • 10.1021/acs.cgd.3c01027
Statistical Analyses of a Population Balance Model of a Batch Crystallization Process
  • Dec 12, 2023
  • Crystal Growth & Design
  • Fernando Arrais Romero Dias Lima + 5 more

  • Open Access Icon
  • Cite Count Icon 101
  • 10.1021/acssuschemeng.6b03098
Mathematical Modeling of Fast Biomass Pyrolysis and Bio-Oil Formation. Note II: Secondary Gas-Phase Reactions and Bio-Oil Formation
  • Mar 3, 2017
  • ACS Sustainable Chemistry & Engineering
  • Eliseo Ranzi + 2 more

  • Open Access Icon
  • Cite Count Icon 197
  • 10.1021/acssuschemeng.6b03096
Mathematical Modeling of Fast Biomass Pyrolysis and Bio-Oil Formation. Note I: Kinetic Mechanism of Biomass Pyrolysis
  • Mar 2, 2017
  • ACS Sustainable Chemistry & Engineering
  • Eliseo Ranzi + 2 more

  • Cite Count Icon 405
  • 10.1002/cjce.5450670111
Kinetic modelling of the pyrolysis of biomass and biomass components
  • Feb 1, 1989
  • The Canadian Journal of Chemical Engineering
  • C A Koufopanos + 2 more

  • Open Access Icon
  • Cite Count Icon 39
  • 10.1016/j.cej.2016.09.135
Microkinetic analysis of ethanol to 1,3-butadiene reactions over MgO-SiO2 catalysts based on characterization of experimental fluctuations
  • Sep 29, 2016
  • Chemical Engineering Journal
  • Simoní Da Ros + 7 more

Similar Papers
  • Research Article
  • Cite Count Icon 16
  • 10.1109/jbhi.2019.2961808
Models of Parkinson's Disease Patient Gait
  • Nov 1, 2020
  • IEEE Journal of Biomedical and Health Informatics
  • James Alexander Hughes + 2 more

Parkinson's Disease is a disorder with diagnostic symptoms that include a change to a walking gait. The disease is problematic to diagnose. An objective method of monitoring the gait of a patient is required to ensure the effectiveness of diagnosis and treatments. We examine the suitability of Extreme Gradient Boosting (XGBoost) and Artificial Neural Network (ANN) Models compared to Symbolic Regression (SR) using genetic programming that was demonstrated to be successful in previous works on gait. The XGBoost and ANN models are found to out-perform SR, but the SR model is more human explainable.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 9
  • 10.3390/en15239008
An Effective Hybrid Symbolic Regression–Deep Multilayer Perceptron Technique for PV Power Forecasting
  • Nov 28, 2022
  • Energies
  • Mohamed Trabelsi + 6 more

The integration of Photovoltaic (PV) systems requires the implementation of potential PV power forecasting techniques to deal with the high intermittency of weather parameters. In the PV power prediction process, Genetic Programming (GP) based on the Symbolic Regression (SR) model has a widespread deployment since it provides an effective solution for nonlinear problems. However, during the training process, SR models might miss optimal solutions due to the large search space for the leaf generations. This paper proposes a novel hybrid model that combines SR and Deep Multi-Layer Perceptron (MLP) for one-month-ahead PV power forecasting. A case study analysis using a real Australian weather dataset was conducted, where the employed input features were the solar irradiation and the historical PV power data. The main contribution of the proposed hybrid SR-MLP algorithm are as follows: (1) The training speed was significantly improved by eliminating unimportant inputs during the feature selection process performed by the Extreme Boosting and Elastic Net techniques; (2) The hyperparameters were preserved throughout the training and testing phases; (3) The proposed hybrid model made use of a reduced number of layers and neurons while guaranteeing a high forecasting accuracy; (4) The number of iterations due to the use of SR was reduced. The presented simulation results demonstrate the higher forecasting accuracy (reductions of more than 20% for Root Mean Square Error (RMSE) and 30 % for Mean Absolute Error (MAE) in addition to an improvement in the R2 evaluation metric) and robustness (preventing the SR from converging to local minima with the help of the ANN branch) of the proposed SR-MLP model as compared to individual SR and MLP models.

  • Research Article
  • Cite Count Icon 15
  • 10.1016/j.watres.2021.117965
A CFD-ML augmented alternative to residence time for clarification basin scaling and design
  • Dec 15, 2021
  • Water Research
  • Haochen Li + 1 more

A CFD-ML augmented alternative to residence time for clarification basin scaling and design

  • Research Article
  • Cite Count Icon 5
  • 10.1016/j.foodchem.2012.02.053
Calibration of artificial neural network and partial least squares regression models for the prediction of secoisolariciresinol diglucoside contents in microwave-assisted extracts of various flaxseed (Linum usitatissimum L.) samples
  • Feb 16, 2012
  • Food Chemistry
  • Simona M Nemes + 2 more

Calibration of artificial neural network and partial least squares regression models for the prediction of secoisolariciresinol diglucoside contents in microwave-assisted extracts of various flaxseed (Linum usitatissimum L.) samples

  • Research Article
  • Cite Count Icon 21
  • 10.1186/s12911-021-01667-8
Application of machine learning to predict the occurrence of arrhythmia after acute myocardial infarction
  • Nov 2, 2021
  • BMC Medical Informatics and Decision Making
  • Suhuai Wang + 6 more

BackgroundEarly identification of the occurrence of arrhythmia in patients with acute myocardial infarction plays an essential role in clinical decision-making. The present study attempted to use machine learning (ML) methods to build predictive models of arrhythmia after acute myocardial infarction (AMI).MethodsA total of 2084 patients with acute myocardial infarction were enrolled in this study. (All data is available on Github: https://github.com/wangsuhuai/AMI-database1.git). The primary outcome is whether tachyarrhythmia occurred during admission containing atrial arrhythmia, ventricular arrhythmia, and supraventricular tachycardia. All data is randomly divided into a training set (80%) and an internal testing set (20%). Apply three machine learning algorithms: decision tree, random forest (RF), and artificial neural network (ANN) to learn the training set to build a model, then use the testing set to evaluate the prediction performance, and compare it with the model built by the Global Registry of Acute Coronary Events (GRACE) risk variable set.ResultsThree ML models predict the occurrence of tachyarrhythmias after AMI. After variable selection, the artificial neural network (ANN) model has reached the highest accuracy rate, which is better than the model constructed using the Grace variable set. After applying SHapley Additive exPlanations (SHAP) to make the model interpretable, the most important features are abnormal wall motion, lesion location, bundle branch block, age, and heart rate. Among them, RBBB (odds ratio [OR]: 4.21; 95% confidence interval [CI]: 2.42–7.02), ≥ 2 ventricular walls motion abnormal (OR: 3.26; 95% CI: 2.01–4.36) and right coronary artery occlusion (OR: 3.00; 95% CI: 1.98–4.56) are significant factors related to arrhythmia after AMI.ConclusionsWe used advanced machine learning methods to build prediction models for tachyarrhythmia after AMI for the first time (especially the ANN model that has the best performance). The current study can supplement the current AMI risk score, provide a reliable evaluation method for the clinic, and broaden the new horizons of ML and clinical research.Trial registration Clinical Trial Registry No.: ChiCTR2100041960.

  • Research Article
  • Cite Count Icon 15
  • 10.1093/annweh/wxaa097
Artificial Neural Networks (ANNs) and Partial Least Squares (PLS) Regression in the Quantitative Analysis of Respirable Crystalline Silica by Fourier-Transform Infrared Spectroscopy (FTIR).
  • Oct 23, 2020
  • Annals of Work Exposures and Health
  • Mina Salehi + 2 more

Respirable crystalline silica (RCS) overexposure can lead to the development of silicosis which is a chronic, irreversible, potentially fatal respiratory disease. The most significant prerequisite for any silica exposure control plan is an accurate occupational exposure assessment. The results of crystalline silica analysis are often affected by other mineral interferences and are influenced by an analyst's knowledge of mineralogy to accurately interpret infrared spectra and correct matrix interferences. Partial least squares (PLS) and artificial neural networks (ANNs) are two multivariate calibration methods to overcome the problem of spectral interferences without the need for an analyst intervention. The performance of these two methods in quantitative analysis of quartz in the presence of mineral interferences was evaluated and compared in this study. Fifty mixtures with different crystalline silica content ratios were prepared by mixing quartz with four common mineral interferences including kaolinite, albite, muscovite, and amorphous silica. Fourier-transform infrared spectra of the mixtures were split into training and test datasets. The optimal architecture of the ANN model was achieved using a two-level full factorial design experiment and data were modeled using ANN and PLS regression analysis. Root mean squared error of prediction values of 1.69 and 6.12 µg quartz for ANN and PLS models, respectively, revealed the fact that the both models performed very well in quantitative analysis of quartz in the presence of mineral interferences, with a better relative performance of the ANN model which can be related to the inherent nonlinear predictive ability of ANNs. Given the excellent predictive ability of the ANN model which can deal with a completely overlapped peak without any need of user's intervention, it is recommended that the ANN model be optimized in future studies and utilized for reliable and rapid on-field assessment of RCS exposure.

  • Conference Article
  • Cite Count Icon 4
  • 10.1109/cec45853.2021.9504767
GP with a Hybrid Tree-vector Representation for Instance Selection and Symbolic Regression on Incomplete Data
  • Jun 28, 2021
  • Baligh Al-Helali + 3 more

Data incompleteness is a pervasive problem in symbolic regression, and machine learning in general. Unfortunately, most symbolic regression methods are only applicable when the given data is complete. One common approach to handling this situation is data imputation. It works by estimating missing values based on existing data. However, which existing data should be used for imputing the missing values? The answer to this question is important when dealing with incomplete data. To address this question, this work proposes a mixed tree-vector representation for genetic programming to perform instance selection and symbolic regression on incomplete data. In this representation, each individual has two components: an expression tree and a bit vector. While the tree component constructs symbolic regression models, the vector component selects the instances that are used to impute missing values by the weighted k-nearest neighbour (WKNN) imputation method. The complete imputed instances are then used to evaluate the GP-based symbolic regression model. The obtained experimental results show the applicability of the proposed method on real-world data sets with different missingness scenarios. When compared with existing methods, the proposed method not only produces more effective symbolic regression models but also achieves more efficient imputations.

  • Research Article
  • Cite Count Icon 16
  • 10.1186/s10033-023-00876-8
Crack Growth Rate Model Derived from Domain Knowledge-Guided Symbolic Regression
  • Mar 27, 2023
  • Chinese Journal of Mechanical Engineering
  • Shuwei Zhou + 4 more

Machine learning (ML) has powerful nonlinear processing and multivariate learning capabilities, so it has been widely utilised in the fatigue field. However, most ML methods are inexplicable black-box models that are difficult to apply in engineering practice. Symbolic regression (SR) is an interpretable machine learning method for determining the optimal fitting equation for datasets. In this study, domain knowledge-guided SR was used to determine a new fatigue crack growth (FCG) rate model. Three terms of the variable subtree of ΔK, R-ratio, and ΔKth were obtained by analysing eight traditional semi-empirical FCG rate models. Based on the FCG rate test data from other literature, the SR model was constructed using Al-7055-T7511. It was subsequently extended to other alloys (Ti-10V-2Fe-3Al, Ti-6Al-4V, Cr-Mo-V, LC9cs, Al-6013-T651, and Al-2324-T3) using multiple linear regression. Compared with the three semi-empirical FCG rate models, the SR model yielded higher prediction accuracy. This result demonstrates the potential of domain knowledge-guided SR for building the FCG rate model.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 11
  • 10.1038/s41598-024-64386-w
Prediction and reliability analysis of shear strength of RC deep beams
  • Jun 25, 2024
  • Scientific Reports
  • Khaled Megahed

This study explores machine learning (ML) capabilities for predicting the shear strength of reinforced concrete deep beams (RCDBs). For this purpose, eight typical machine-learning models, i.e., symbolic regression (SR), XGBoost (XGB), CatBoost (CATB), random forest (RF), LightGBM, support vector regression (SVR), artificial neural networks (ANN), and Gaussian process regression (GPR) models, are selected and compared based on a database of 840 samples with 14 input features. The hyperparameter tuning of the introduced ML models is performed using the Bayesian optimization (BO) technique. The comparison results show that the CatBoost model is the most reliable and accurate ML model (R2 = 0.997 and 0.947 in the training and testing sets, respectively). In addition, simple and practical design expressions for RCDBs have been proposed based on the SR model with a physical meaning and acceptable accuracy (an average prediction-to-test ratio of 0.935 and a standard deviation of 0.198). Meanwhile, the shear strength predicted by ML models was then compared with classical mechanics-driven shear models, including two prominent practice codes (i.e., ACI318, EC2) and two previous mechanical models, which indicated that the ML approach is highly reliable and accurate over conventional methods. In addition, a reliability-based design was conducted on two ML models, and their reliability results were compared with those of two code standards. The findings revealed that the ML models demonstrate higher reliability compared to code standards.

  • Research Article
  • Cite Count Icon 173
  • 10.1016/j.cageo.2012.11.015
Monthly river flow forecasting using artificial neural network and support vector regression models coupled with wavelet transform
  • Nov 28, 2012
  • Computers & Geosciences
  • Aman Mohammad Kalteh

Monthly river flow forecasting using artificial neural network and support vector regression models coupled with wavelet transform

  • Research Article
  • Cite Count Icon 90
  • 10.3171/2013.1.jns121130
In-hospital mortality after traumatic brain injury surgery: a nationwide population-based comparison of mortality predictors used in artificial neural network and logistic regression models
  • Feb 1, 2013
  • Journal of Neurosurgery
  • Hon-Yi Shi + 3 more

Most reports compare artificial neural network (ANN) models and logistic regression models in only a single data set, and the essential issue of internal validity (reproducibility) of the models has not been adequately addressed. This study proposes to validate the use of the ANN model for predicting in-hospital mortality after traumatic brain injury (TBI) surgery and to compare the predictive accuracy of ANN with that of the logistic regression model. The authors of this study retrospectively analyzed 16,956 patients with TBI nationwide who were surgically treated in Taiwan between 1998 and 2009. For every 1000 pairs of ANN and logistic regression models, the area under the receiver operating characteristic curve (AUC), Hosmer-Lemeshow statistics, and accuracy rate were calculated and compared using paired t-tests. A global sensitivity analysis was also performed to assess the relative importance of input parameters in the ANN model and to rank the variables in order of importance. The ANN model outperformed the logistic regression model in terms of accuracy in 95.15% of cases, in terms of Hosmer-Lemeshow statistics in 43.68% of cases, and in terms of the AUC in 89.14% of cases. The global sensitivity analysis of in-hospital mortality also showed that the most influential (sensitive) parameters in the ANN model were surgeon volume followed by hospital volume, Charlson comorbidity index score, length of stay, sex, and age. This work supports the continued use of ANNs for predictive modeling of neurosurgery outcomes. However, further studies are needed to confirm the clinical efficacy of the proposed model.

  • Conference Article
  • Cite Count Icon 2
  • 10.1109/micai.2014.33
Modeling Synthesis Processes of Photocatalysts Using Symbolic Regression α-β
  • Nov 1, 2014
  • G Gonzalez-Campos + 3 more

Symbolic regression is an application of genetic programming and is used for modeling different dynamic processes. Industrial processes problems have been solved using this technique. In this work a symbolic regression algorithm is used for modeling the synthesis process of the oxides Bi2MoO6and V2O5 in order to provide a model. These oxides are used on heterogeneous photo catalysis. Genetic programming, artificial neural network and linear regression are compared with symbolic regression models using statistics metrics to evaluate them.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 14
  • 10.1038/s41598-024-53352-1
Prediction of the axial compression capacity of stub CFST columns using machine learning techniques
  • Feb 5, 2024
  • Scientific Reports
  • Khaled Megahed + 2 more

Concrete-filled steel tubular (CFST) columns have extensive applications in structural engineering due to their exceptional load-bearing capability and ductility. However, existing design code standards often yield different design capacities for the same column properties, introducing uncertainty for engineering designers. Moreover, conventional regression analysis fails to accurately predict the intricate relationship between column properties and compressive strength. To address these issues, this study proposes the use of two machine learning (ML) models—Gaussian process regression (GPR) and symbolic regression (SR). These models accept a variety of input variables, encompassing geometric and material properties of stub CFST columns, to estimate their strength. An experimental database of 1316 specimens was compiled from various research papers, including circular, rectangular, and double-skin stub CFST columns. In addition, a dimensionless output variable, referred to as the strength index, is introduced to enhance model performance. To validate the efficiency of the introduced models, predictions from these models are compared with those from two established standard codes and various ML algorithms, including support vector regression optimized with particle swarm optimization (PSVR), artificial neural networks, XGBoost (XGB), CatBoost (CATB), Random Forest, and LightGBM models. Through performance metrics, the CATB, GPR, PSVR and XGB models emerge as the most accurate and reliable models from the evaluation results. In addition, simple and practical design equations for the different types of CFST columns have been proposed based on the SR model. The developed ML models and proposed equations can predict the compressive strength of stub CFST columns with reliable and accurate results, making them valuable tools for structural engineering. Furthermore, the Shapley additive interpretation (SHAP) technique is employed for feature analysis. The results of the feature analysis reveal that section slenderness ratio and concrete strength parameters negatively impact the compressive strength index.

  • Research Article
  • 10.11591/ijai.v14.i1.pp286-297
Explainable machine learning models applied to predicting customer churn for e-commerce
  • Feb 1, 2025
  • IAES International Journal of Artificial Intelligence (IJ-AI)
  • Ikhlass Boukrouh + 1 more

<span lang="EN-US">Precise identification of customer churn is crucial for e-commerce companies due to the high costs associated with acquiring new customers. In this sector, where revenues are affected by customer churn, the challenge is intensified by the diversity of product choices offered on various marketplaces. Customers can easily switch from one platform to another, emphasizing the need for accurate churn classification to anticipate revenue fluctuations in <br /> e-commerce. In this context, this study proposes seven machine learning classification models to predict customer churn, including decision tree (DT), random forest (RF), support vector machine (SVM), logistic regression (LR), naïve Bayes (NB), k-nearest neighbors (K-NN), and artificial neural network (ANN). The performances of the models were evaluated using confusion matrix, accuracy, precision, recall, and F1-score. The results indicated that the ANN model achieves the highest accuracy at 92.09%, closely followed by RF at 91.21%. In contrast, the NB model performed the least favorably with an accuracy of 75.04%. Two explainable artificial intelligence (XAI) methods, shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME), were used to explain the models. SHAP provided global explanations for both ANN and RF models through Kernel SHAP and Tree SHAP. LIME, offering local explanations, was applied only to the ANN model which gave better accuracy.</span>

  • Research Article
  • Cite Count Icon 13
  • 10.1016/j.jaap.2024.106486
Machine learning prediction of bio-oil production from the pyrolysis of lignocellulosic biomass: Recent advances and future perspectives
  • Mar 30, 2024
  • Journal of Analytical and Applied Pyrolysis
  • Hyojin Lee + 2 more

Machine learning prediction of bio-oil production from the pyrolysis of lignocellulosic biomass: Recent advances and future perspectives

More from: Bioresource technology
  • New
  • Research Article
  • 10.1016/j.biortech.2025.133579
Engineering a living therapeutic: calcium phosphate-mineralized phycocyanobilin-producing E. coli as a self-contained photodynamic system.
  • Nov 6, 2025
  • Bioresource technology
  • Yifan Wei + 9 more

  • New
  • Research Article
  • 10.1016/j.biortech.2025.133625
Transcriptomic insights into TiO2 nanoparticle-assisted enhancement of growth, lipid production, and dark fermentation wastewater treatment in Chlorella sp. DT025.
  • Nov 6, 2025
  • Bioresource technology
  • Xiaolong Wu + 6 more

  • Research Article
  • 10.1016/j.biortech.2025.132878
Enhanced thioether formation in stormwater pipes induced by nitrogen-containing pollutants: The role of the sediment microbiome.
  • Nov 1, 2025
  • Bioresource technology
  • Fangyuan Jiang + 8 more

  • Research Article
  • 10.1016/j.biortech.2025.132867
Ensiling of hop chaff for biogas and fibre production using molasses, nitrate and lactic acid bacteria as ensiling agents.
  • Nov 1, 2025
  • Bioresource technology
  • Benedikt Hülsemann + 3 more

  • Research Article
  • 10.1016/j.biortech.2025.132993
Novel insights into pretreatment improving the cooperation of adhesive and planktonic microbes in anaerobic digestion.
  • Nov 1, 2025
  • Bioresource technology
  • Ligong Wang + 6 more

  • Research Article
  • 10.1016/j.biortech.2025.133601
Mechanism of biomass driving microbial community assembly in eco-engineering bauxite residues: Based on molecular traits of dissolved organic matter.
  • Nov 1, 2025
  • Bioresource technology
  • Guizhen Wang + 10 more

  • Research Article
  • 10.1016/s0960-8524(25)01061-2
Editorial Board
  • Nov 1, 2025
  • Bioresource Technology

  • Research Article
  • 10.1016/j.biortech.2025.133050
Environmental and operational stresses in PHA Biosynthesis: Mechanisms, Challenges, and Sustainable Solutions.
  • Nov 1, 2025
  • Bioresource technology
  • Lei Yang + 4 more

  • Research Article
  • 10.1016/j.biortech.2025.132922
Ultraviolet-shielding bioplastic films from Antarctic Klebsormidium sp. ASYA19: A sustainable approach to plastic alternatives.
  • Nov 1, 2025
  • Bioresource technology
  • Barış Ballık + 4 more

  • Research Article
  • 10.1016/j.biortech.2025.133021
Integrating methane conversion with multi-pathway nitrogen removal for wastewater in membrane-enabled stratified biofilm: System development and microbial dynamics.
  • Nov 1, 2025
  • Bioresource technology
  • Chun-Yan Yuan + 9 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon