Prediction of interface shear strength between ultra-high-performance concrete and concrete using machine learning method
Ultra-high-performance concrete (UHPC) bonded to normal concrete (NC) can significantly enhance the mechanical performance of UHPC–NC composite structures, and the interface shear strength is a crucial indicator for assessing the bonding performance. In this study, interpretable machine learning (ML) methods were used to analyse the effects of different parameters on interface shear strength. A database consisting of 305 UHPC–NC shear tests was created, and the isolation forest algorithm was applied to filter outliers. Subsequently, four ML models were trained to predict the interface shear strength of UHPC–NC composite structures. Among them, the extreme gradient boosting (XGBoost) model demonstrated the highest prediction accuracy, achieving an R2 value of 0.95. Shapley additive explanations (SHAP), partial dependence plots (PDP) and individual conditional expectation (ICE) were used for feature importance analysis, aiding in the interpretation of the ‘black box’ nature of the ML models. The results demonstrate that the normal compressive stress at the interface is the most influential factor affecting interfacial shear strength. Finally, a physically meaningful predictive equation for the interface shear strength of UHPC–NC composite structures was proposed based on the XGBoost model combined with curve fitting. This equation enhances the prediction accuracy of interface shear strength for UHPC–NC structures and offers deeper insights into the model’s decision making process.
- Research Article
2
- 10.3389/fcvm.2025.1444323
- Jan 24, 2025
- Frontiers in cardiovascular medicine
Early prediction of heart failure (HF) after acute myocardial infarction (AMI) is essential for personalized treatment. We aimed to use interpretable machine learning (ML) methods to develop a risk prediction model for HF in AMI patients. We retrospectively included patients initially with AMI who received percutaneous coronary intervention (PCI) in our hospital from November 2016 to February 2020. The primary endpoint was the occurrence of HF within 3 years after operation. For developing a predictive model for HF risk in AMI patients, the least absolute shrinkage and selection operator (LASSO) Regression was used to feature selection, and four ML algorithms including Random Forest (RF), Extreme Gradient Boost (XGBoost), Support Vector Machine (SVM), and Logistic Regression (LR) were employed to develop the model on the training set. The performance evaluation of the prediction model was carried out on the training set and the testing set, utilizing metrics including AUC (Area under the receiver operating characteristic curve), calibration plot, and decision curve analysis (DCA). In addition, we used the Shapley Additive Explanations (SHAP) value to determine the importance of the selected features and interpret the optimal model. A total of 1220 AMI patients were included and 244 (20%) patients developed HF during follow-up. Among the four evaluated ML models, the XGBoost model exhibited exceptional accuracy, with an AUC value of 0.922. The SHAP method showed that left ventricular ejection fraction (LVEF), left ventricular end-systolic diameter (LVDs) and lactate dehydrogenase (LDH) were identified as the three most important characteristics to predict HF risk in AMI patients. Individual risk assessment was performed using SHAP plots and waterfall plot analysis. Our research demonstrates the potential of ML methods in the early prediction of HF risk in AMI patients. Furthermore, it enhances the interpretability of the XGBoost model through SHAP analysis to guide clinical decision-making.
- Research Article
53
- 10.1371/journal.pone.0284315
- May 4, 2023
- PLOS ONE
Machine learning (ML) models are used in clinical metabolomics studies most notably for biomarker discoveries, to identify metabolites that discriminate between a case and control group. To improve understanding of the underlying biomedical problem and to bolster confidence in these discoveries, model interpretability is germane. In metabolomics, partial least square discriminant analysis (PLS-DA) and its variants are widely used, partly due to the model’s interpretability with the Variable Influence in Projection (VIP) scores, a global interpretable method. Herein, Tree-based Shapley Additive explanations (SHAP), an interpretable ML method grounded in game theory, was used to explain ML models with local explanation properties. In this study, ML experiments (binary classification) were conducted for three published metabolomics datasets using PLS-DA, random forests, gradient boosting, and extreme gradient boosting (XGBoost). Using one of the datasets, PLS-DA model was explained using VIP scores, while one of the best-performing models, a random forest model, was interpreted using Tree SHAP. The results show that SHAP has a more explanation depth than PLS-DA’s VIP, making it a powerful method for rationalizing machine learning predictions from metabolomics studies.
- Conference Article
- 10.15396/eres2021_104
- Jan 1, 2021
Machine Learning (ML) can detect complex relationships to solve problems in various research areas. To estimate real estate prices and rents, ML represents a promising extension to the hedonic literature since it is able to increase predictive accuracy and is more flexible than the standard regression-based hedonic approach in handling a variety of quantitative and qualitative inputs. Nevertheless, its inferential capacity is limited due to its complex non-parametric structure and the ‘black box’ nature of its operations. In recent years, research on Interpretable Machine Learning (IML) has emerged that improves the interpretability of ML applications. This paper aims to elucidate the analytical behaviour of ML methods and their predictions of residential rents applying a set of model-agnostic methods. Using a dataset of 58k apartment listings in Frankfurt am Main (Germany), we estimate rent levels with the eXtreme Gradient Boosting Algorithm (XGB). We then apply Permutation Feature Importance (PFI), Partial Dependence Plots (PDP), Individual Conditional Expectation Curve (ICE) and Accumulated Local Effects (ALE). Our results suggest that IML methods can provide valuable insights and yield higher interpretability of ‘black box’ models. According to the results of PFI, most relevant locational variables for apartments are the proximity to bars, convenience stores and bus station hubs. Feature effects show that ML identifies non-linear relationships between rent and proximity variables. Rental prices increase up to a distance of approx. 3 kilometer to a central bus hub, followed by steep decline. We therefore assume tenants to face a trade-off between good infrastructural accessibility and locational separation from the disamenities associated with traffic hubs such as noise and air pollution. The same holds true for proximity to bar with rents peaking at 1 km distance. While tenants appear to appreciate nearby nightlife facilities, immediate proximity is subject to rental discounts. In summary, IML methods can increase transparency of ML models and therefore identify important patterns in rental markets. This may lead to a better understanding of residential real estate and offer new insights for researchers as well as practitioners.
- Research Article
9
- 10.3389/fpubh.2022.1086339
- Jan 12, 2023
- Frontiers in Public Health
BackgroundRisk stratification of elderly patients with ischemic stroke (IS) who are admitted to the intensive care unit (ICU) remains a challenging task. This study aims to establish and validate predictive models that are based on novel machine learning (ML) algorithms for 28-day in-hospital mortality in elderly patients with IS who were admitted to the ICU.MethodsData of elderly patients with IS were extracted from the electronic intensive care unit (eICU) Collaborative Research Database (eICU-CRD) records of those elderly patients admitted between 2014 and 2015. All selected participants were randomly divided into two sets: a training set and a validation set in the ratio of 8:2. ML algorithms, such as Naïve Bayes (NB), eXtreme Gradient Boosting (xgboost), and logistic regression (LR), were applied for model construction utilizing 10-fold cross-validation. The performance of models was measured by the area under the receiver operating characteristic curve (AUC) analysis and accuracy. The present study uses interpretable ML methods to provide insight into the model's prediction and outcome using the SHapley Additive exPlanations (SHAP) method.ResultsAs regards the population demographics and clinical characteristics, the analysis in the present study included 1,236 elderly patients with IS in the ICU, of whom 164 (13.3%) died during hospitalization. As regards feature selection, a total of eight features were selected for model construction. In the training set, both the xgboost and NB models showed specificity values of 0.989 and 0.767, respectively. In the internal validation set, the xgboost model identified patients who died with an AUC value of 0.733 better than the LR model which identified patients who died with an AUC value of 0.627 or the NB model 0.672.ConclusionThe xgboost model shows the best predictive performance that predicts mortality in elderly patients with IS in the ICU. By making the ML model explainable, physicians would be able to understand better the reasoning behind the outcome.
- Research Article
46
- 10.1016/j.aap.2022.106617
- Feb 21, 2022
- Accident Analysis & Prevention
On the interpretability of machine learning methods in crash frequency modeling and crash modification factor development
- Research Article
3
- 10.1002/ehf2.14834
- May 15, 2024
- ESC Heart Failure
AimsIn recent years, there has been remarkable development in machine learning (ML) models, showing a trend towards high prediction performance. ML models with high prediction performance often become structurally complex and are frequently perceived as black boxes, hindering intuitive interpretation of the prediction results. We aimed to develop ML models with high prediction performance, interpretability, and superior risk stratification to predict in‐hospital mortality and worsening heart failure (WHF) in patients with acute heart failure (AHF).Methods and resultsBased on the Kyoto Congestive Heart Failure registry, which enrolled 4056 patients with AHF, we developed prediction models for in‐hospital mortality and WHF using information obtained on the first day of admission (demographics, physical examination, blood test results, etc.). After excluding 16 patients who died on the first or second day of admission, the original dataset (n = 4040) was split 4:1 into training (n = 3232) and test datasets (n = 808). Based on the training dataset, we developed three types of prediction models: (i) the classification and regression trees (CART) model; (ii) the random forest (RF) model; and (iii) the extreme gradient boosting (XGBoost) model. The performance of each model was evaluated using the test dataset, based on metrics including sensitivity, specificity, area under the receiver operating characteristic curve (AUC), Brier score, and calibration slope. For the complex structure of the XGBoost model, we performed SHapley Additive exPlanations (SHAP) analysis, classifying patients into interpretable clusters. In the original dataset, the proportion of females was 44.8% (1809/4040), and the average age was 77.9 ± 12.0. The in‐hospital mortality rate was 6.3% (255/4040) and the WHF rate was 22.3% (900/4040) in the total study population. In the in‐hospital mortality prediction, the AUC for the XGBoost model was 0.816 [95% confidence interval (CI): 0.815–0.818], surpassing the AUC values for the CART model (0.683, 95% CI: 0.680–0.685) and the RF model (0.755, 95% CI: 0.753–0.757). Similarly, in the WHF prediction, the AUC for the XGBoost model was 0.766 (95% CI: 0.765–0.768), outperforming the AUC values for the CART model (0.688, 95% CI: 0.686–0.689) and the RF model (0.713, 95% CI: 0.711–0.714). In the XGBoost model, interpretable clusters were formed, and the rates of in‐hospital mortality and WHF were similar among each cluster in both the training and test datasets.ConclusionsThe XGBoost models with SHAP analysis provide high prediction performance, interpretability, and reproducible risk stratification for in‐hospital mortality and WHF for patients with AHF.
- Research Article
1
- 10.1016/j.cscm.2024.e03840
- Oct 10, 2024
- Case Studies in Construction Materials
To satisfy the design strength of manufactured sand concrete (MSC) in practical engineering applications, a plethora of geotechnical tests are frequently conducted. An effective approach is imperative to reduce the consumption of labor and resources during these tests. The objective of this paper is to introduce an interpretable machine learning (ML) method to evaluate the compressive strength (CS) of MSC. Firstly, a dataset was established by compiling experimental results from 208 literatures. 3382 data points were selected from the dataset for algorithm training. Recursive Feature Elimination with Cross-Validation (RFECV) was employed to select input parameters. Four algorithms with 12 selected input variables and 1 output variable were constructed to predict CS of MSC using Random Forest (RF), Gradient Boosting Decision Trees (GBDT), eXtreme Gradient Boosting algorithm (XGBoost), and Categorical Boosting (CatBoost). The results show that XGBoost has the highest accuracy and generalization ability (R2=0.934, MAE=3.44, RMSE=5.16, MAPE=0.07). To enhance model transparency, SHapley Additive exPlanations (SHAP) was adopted to explain the underlying predictive mechanisms of ML models. Analyses show that, 1) the cement content, curing time, and water content were the main feature parameters influencing the CS of MSC, 2) the cement content, curing time, and water content have a linear increase, logarithmic increase, and exponential decrease with the CS of MSC, respectively. Partial Dependence Plots (PDP) and Individual Conditional Expectation (ICE) plots were used to further analyze the impacts of these significant influencing factors on the CS of MSC. Additionally, the Local Interpretable Model-Agnostic Explanations (LIME) method was employed to investigate thresholds for various material dosages in MSC containing 5–10 % stone powder. Two typical scenarios were selected for analysis, yielding recommended dosage ranges for concrete of two distinct strengths. Finally, a graphical user interface (GUI) for the CS of MSC has been designed, which might be of great use to material engineers. This provides reference and guidance for concrete engineering practice.
- Research Article
2
- 10.1080/00015385.2025.2481662
- Apr 7, 2025
- Acta Cardiologica
Background Predicting the prognosis of patients with acute myocardial infarction (AMI) combined with diabetes mellitus (DM) is crucial due to high in-hospital mortality rates. This study aims to develop and validate a mortality risk prediction model for these patients by interpretable machine learning (ML) methods. Methods Data were sourced from the Medical Information Mart for Intensive Care IV (MIMIC-IV, version 2.2). Predictors were selected by Least absolute shrinkage and selection operator (LASSO) regression and checked for multicollinearity with Spearman’s correlation. Patients were randomly assigned to training and validation sets in an 8:2 ratio. Seven ML algorithms were used to construct models in the training set. Model performance was evaluated in the validation set using metrics such as area under the curve (AUC) with 95% confidence interval (CI), calibration curves, precision, recall, F1 score, accuracy, negative predictive value (NPV), and positive predictive value (PPV). The significance of differences in predictive performance among models was assessed utilising the permutation test, and 10-fold cross-validation further validated the model’s performance. SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) were applied to interpret the models. Results The study included 2,828 patients with AMI combined with DM. Nineteen predictors were identified through LASSO regression and Spearman’s correlation. The Random Forest (RF) model was demonstrated the best performance, with an AUC of 0.823 (95% CI: 0.774–0.872), high precision (0.867), accuracy (0.873), and PPV (0.867). The RF model showed significant differences (p < 0.05) compared to the K-Nearest Neighbours and Decision Tree models. Calibration curves indicated that the RF model’s predicted risk aligned well with actual outcomes. 10-fold cross-validation confirmed the superior performance of RF model, with an average AUC of 0.828 (95% CI: 0.800–0.842). Significant Variables in RF model indicated that the top eight significant predictors were urine output, maximum anion gap, maximum urea nitrogen, age, minimum pH, maximum international normalised ratio (INR), mean respiratory rate, and mean systolic blood pressure. Conclusion This study demonstrates the potential of ML methods, particularly the RF model, in predicting in-hospital mortality risk for AMI patients with DM. The SHAP and LIME methods enhance the interpretability of ML models.
- Research Article
- 10.1002/bimj.70089
- Oct 30, 2025
- Biometrical Journal. Biometrische Zeitschrift
ABSTRACTWith the spread and rapid advancement of black box machine learning (ML) models, the field of interpretable machine learning (IML) or explainable artificial intelligence (XAI) has become increasingly important over the last decade. This is particularly relevant for survival analysis, where the adoption of IML techniques promotes transparency, accountability, and fairness in sensitive areas, such as clinical decision‐making processes, the development of targeted therapies, interventions, or in other medical or healthcare‐related contexts. More specifically, explainability can uncover a survival model's potential biases and limitations and provide more mathematically sound ways to understand how and which features are influential for prediction or constitute risk factors. However, the lack of readily available IML methods may have deterred practitioners from leveraging the full potential of ML for predicting time‐to‐event data. We present a comprehensive review of the existing work on IML methods for survival analysis within the context of the general IML taxonomy. In addition, we formally detail how commonly used IML methods, such as individual conditional expectation (ICE), partial dependence plots (PDP), accumulated local effects (ALE), different feature importance measures, or Friedman's H‐interaction statistics can be adapted to survival outcomes. An application of several IML methods to data on breast cancer recurrence in the German Breast Cancer Study Group (GBSG2) serves as a tutorial or guide for researchers, on how to utilize the techniques in practice to facilitate understanding of model decisions or predictions.
- Research Article
12
- 10.1080/0886022x.2023.2212790
- May 19, 2023
- Renal Failure
Background This study aimed to establish and validate a machine learning (ML) model for predicting in-hospital mortality in critically ill patients with chronic kidney disease (CKD). Methods This study collected data on CKD patients from 2008 to 2019 using the Medical Information Mart for Intensive Care IV. Six ML approaches were used to build the model. Accuracy and area under the curve (AUC) were used to choose the best model. In addition, the best model was interpreted using SHapley Additive exPlanations (SHAP) values. Results There were 8527 CKD patients eligible for participation; the median age was 75.1 (interquartile range: 65.0–83.5) years, and 61.7% (5259/8527) were male. We developed six ML models with clinical variables as input factors. Among the six models developed, the eXtreme Gradient Boosting (XGBoost) model had the highest AUC, at 0.860. According to the SHAP values, the sequential organ failure assessment score, urine output, respiratory rate, and simplified acute physiology score II were the four most influential variables in the XGBoost model. Conclusions In conclusion, we successfully developed and validated ML models for predicting mortality in critically ill patients with CKD. Among all ML models, the XGBoost model is the most effective ML model that can help clinicians accurately manage and implement early interventions, which may reduce mortality in critically ill CKD patients with a high risk of death.
- Research Article
- 10.1017/psy.2025.10032
- Jul 31, 2025
- Psychometrika
This study incorporates a random forest (RF) approach to probe complex interactions and nonlinearity among predictors into an item response model with the goal of using a hybrid approach to outperform either an RF or explanatory item response model (EIRM) only in explaining item responses. In the specified model, called EIRM-RF, predicted values using RF are added as a predictor in EIRM to model the nonlinear and interaction effects of person- and item-level predictors in person-by-item response data, while accounting for random effects over persons and items. The results of the EIRM-RF are probed with interpretable machine learning (ML) methods, including feature importance measures, partial dependence plots, accumulated local effect plots, and the H-statistic. The EIRM-RF and the interpretable methods are illustrated using an empirical data set to explain differences in reading comprehension in digital versus paper mediums, and the results of EIRM-RF are compared with those of EIRM and RF to show empirical differences in modeling the effects of predictors and random effects among EIRM, RF, and EIRM-RF. In addition, simulation studies are conducted to compare model accuracy among the three models and to evaluate the performance of interpretable ML methods.
- Research Article
8
- 10.3389/fimmu.2024.1367340
- May 1, 2024
- Frontiers in Immunology
The relationship between systemic inflammatory index (SII), sex steroid hormones, dietary antioxidants (DA), and gout has not been determined. We aim to develop a reliable and interpretable machine learning (ML) model that links SII, sex steroid hormones, and DA to gout identification. The dataset we used to study the relationship between SII, sex steroid hormones, DA, and gout was from the National Health and Nutrition Examination Survey (NHANES). Six ML models were developed to identify gout by SII, sex steroid hormones, and DA. The seven performance discriminative features of each model were summarized, and the eXtreme Gradient Boosting (XGBoost) model with the best overall performance was selected to identify gout. We used the SHapley Additive exPlanation (SHAP) method to explain the XGBoost model and its decision-making process. An initial survey of 20,146 participants resulted in 8,550 being included in the study. Selecting the best performing XGBoost model associated with SII, sex steroid hormones, and DA to identify gout (male: AUC: 0.795, 95% CI: 0.746- 0.843, accuracy: 98.7%; female: AUC: 0.822, 95% CI: 0.754- 0.883, accuracy: 99.2%). In the male group, The SHAP values showed that the lower feature values of lutein + zeaxanthin (LZ), vitamin C (VitC), lycopene, zinc, total testosterone (TT), vitamin E (VitE), and vitamin A (VitA), the greater the positive effect on the model output. In the female group, SHAP values showed that lower feature values of E2, zinc, lycopene, LZ, TT, and selenium had a greater positive effect on model output. The interpretable XGBoost model demonstrated accuracy, efficiency, and robustness in identifying associations between SII, sex steroid hormones, DA, and gout in participants. Decreased TT in males and decreased E2 in females may be associated with gout, and increased DA intake and decreased SII may reduce the potential risk of gout.
- Research Article
30
- 10.1016/j.conbuildmat.2023.133553
- Oct 2, 2023
- Construction and Building Materials
Shear strength prediction of FRP-strengthened concrete beams using interpretable machine learning
- Conference Article
5
- 10.2118/200019-ms
- Sep 24, 2020
Recently machine learning has being extensively deployed for oil and gas industry for improving result and expedite process. However, the black box models do not explain their prediction which considered as a barrier to adopt machine learning. This paper is about optimizing hydraulic fracture with machine learning methods and making informative decision with interpreting machine learning model. The solution can show that it could save over million dollars per well and improve well performance significantly. Interestingly, the machine leaning explainability approach was utilized to explain and measure the reason behind of why some wells are performing better than other and vice versa. Hydraulic fracturing modeling and optimization in tight oil and unconventional reservoir requires substantial geological modeling, fracture design, post-fracture production simulation with excessive sensitivity analysis due to complexity and uncertainty in the nature of data. These types of studies are computationally and monetarily expensive. Furthermore, digital oil technology has facilitated the process of data gathering enabled operators to have access to huge amount of data. Common approaches are no longer suitable to handle this pile of data but machine learning methods could be successfully utilized for this purpose. In this paper, a variety types of advanced machine learning methods including linear regression, Random forest, Gradient Boost, XGBoost, Bagging, ExtraTrees and neural network were employed to optimize well completion in Montney formation. The objective was to create a robust predictive model capturing all the effective operational well parameters (features) capable of optimizing the first 12 months cumulative of equivalent well production. Special Individual Conditional Expectation (ICE) plots and Partial Dependency plots(PDP) were used to depict how HF completion features influence the prediction of a machine learning model. Furthermore, a novel approach was employed to explain the model prediction of an existing well by computing the contribution of each feature to the prediction. Over 1838 hydraulically fractured (HF) wells producing from 2008 till 2019 in Montney formation have been considered for this analysis. The outcome of Explanatory Data Analysis (EDA) revealed that well production performance has not been improved despite of continues enhancement of hydraulic fracture parameters such as proppant injected volume, length of stimulated horizontal wells, and number of stages per well in the course of two years. This finding raises the concern of whether operators are properly optimizing completion design. After comparing all machine learning methods, Random Forest method was chosen as the most appropriate and accurate method to proceed for further analysis. ICE and PDP plots helped to understand the impacts of different fracturing features on production for individual well in addition to define optimum operation features on Montney Formation. Furthermore, quantifying of each feature’s impact on individual well production and linking it to an economic model, we were able to demonstrate potential profit and loss for each well. The model suggests that some wells could have achieved over $1 million extra profit during the first 12-months of production. In this study, not only a reliable predictive data-driven model has been built for hydraulically-fractured wells in Montney formation, but also a comprehensive workflow of sensitivity and explainatability analysis has been introduced to obtain an optimized fit-to-purpose well completion design.
- Research Article
- 10.1007/s00704-025-05703-9
- Aug 26, 2025
- Theoretical and Applied Climatology
Climate signals, driven by complex interactions and nonlinear relationships, shape weather patterns and long-term trends, complicating the identification of dominant drivers due to collinearity. This study investigates the consistency and uncertainty of machine learning (ML) techniques for feature importance in climate science, comparing SHapley Additive exPlanations (SHAP), Partial Dependence Plots (PDPs), and gain-based feature importance from Extreme Gradient Boosting (XGBoost). SHAP’s integration with Feed Forward Neural Networks (FFNN) and XGBoost is evaluated to assess model-specific uncertainties. Using winter precipitation data from Ohio, USA, as a case study, the relative contributions of global warming (GW) and the Interdecadal Pacific Oscillation (IPO) to precipitation changes are quantified. Results show GW consistently ranks higher than IPO in at least 60% of stations across all methods, with SHAP and PDPs agreeing in 89% of stations. Global SHAP importance from FFNN and XGBoost aligns in 82% of stations, with GW contributing 15% more than IPO on average, though disagreements in 18% of stations highlight model-dependent uncertainties. Temporal analysis using SHAP values indicates a moderate discrepancy in feature importance between FFNN and XGBoost models (Pearson correlation ≈ 0.5), despite their consensus on the increasing dominance of GW in recent decades, contributing to wetter winters. Regression analysis further confirms that GW accounts for approximately 70% of the multi-decadal variability in winter precipitation across Ohio, with PDPs indicating a strong monotonicity (ρ = 0.94) between warming levels and precipitation increase. PDPs visualize marginal effects but struggle with interactions, while gain-based methods tend to favor features with a greater number of effective split points that reduce loss. SHAP, though robust for ranking, varies with the base model. An ensemble framework is proposed, demonstrating the value of combining these ML techniques complementarily to account for uncertainties and enhance interpretability. This study highlights the importance of addressing methodological uncertainties in feature importance rankings to provide robust insights for climate modeling.
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.