An explainable machine learning approach (SHAP) to assessing desertification risk and its drivers in the Ring-Tarim Basin, 1990–2020

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

An explainable machine learning approach (SHAP) to assessing desertification risk and its drivers in the Ring-Tarim Basin, 1990–2020

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 32
  • 10.3389/fmed.2021.663739
Explainable Machine Learning to Predict Successful Weaning Among Patients Requiring Prolonged Mechanical Ventilation: A Retrospective Cohort Study in Central Taiwan.
  • Apr 23, 2021
  • Frontiers in Medicine
  • Ming-Yen Lin + 6 more

Objective: The number of patients requiring prolonged mechanical ventilation (PMV) is increasing worldwide, but the weaning outcome prediction model in these patients is still lacking. We hence aimed to develop an explainable machine learning (ML) model to predict successful weaning in patients requiring PMV using a real-world dataset.Methods: This retrospective study used the electronic medical records of patients admitted to a 12-bed respiratory care center in central Taiwan between 2013 and 2018. We used three ML models, namely, extreme gradient boosting (XGBoost), random forest (RF), and logistic regression (LR), to establish the prediction model. We further illustrated the feature importance categorized by clinical domains and provided visualized interpretation by using SHapley Additive exPlanations (SHAP) as well as local interpretable model-agnostic explanations (LIME).Results: The dataset contained data of 963 patients requiring PMV, and 56.0% (539/963) of them were successfully weaned from mechanical ventilation. The XGBoost model (area under the curve [AUC]: 0.908; 95% confidence interval [CI] 0.864–0.943) and RF model (AUC: 0.888; 95% CI 0.844–0.934) outperformed the LR model (AUC: 0.762; 95% CI 0.687–0.830) in predicting successful weaning in patients requiring PMV. To give the physician an intuitive understanding of the model, we stratified the feature importance by clinical domains. The cumulative feature importance in the ventilation domain, fluid domain, physiology domain, and laboratory data domain was 0.310, 0.201, 0.265, and 0.182, respectively. We further used the SHAP plot and partial dependence plot to illustrate associations between features and the weaning outcome at the feature level. Moreover, we used LIME plots to illustrate the prediction model at the individual level. Additionally, we addressed the weekly performance of the three ML models and found that the accuracy of XGBoost/RF was ~0.7 between weeks 4 and week 7 and slightly declined to 0.6 on weeks 8 and 9.Conclusion: We used an ML approach, mainly XGBoost, SHAP plot, and LIME plot to establish an explainable weaning prediction ML model in patients requiring PMV. We believe these approaches should largely mitigate the concern of the black-box issue of artificial intelligence, and future studies are warranted for the landing of the proposed model.

  • Research Article
  • Cite Count Icon 13
  • 10.1016/j.cej.2022.138036
An improved machine learning approach for predicting granular flows
  • Jul 12, 2022
  • Chemical Engineering Journal
  • Dan Xu + 1 more

An improved machine learning approach for predicting granular flows

  • Research Article
  • Cite Count Icon 16
  • 10.1016/j.conbuildmat.2023.130321
Optimized machine learning approaches for identifying vertical temperature gradient on ballastless track in natural environments
  • Jan 16, 2023
  • Construction and Building Materials
  • Tao Shi + 1 more

Optimized machine learning approaches for identifying vertical temperature gradient on ballastless track in natural environments

  • Research Article
  • Cite Count Icon 4
  • 10.1016/j.jval.2024.12.010
Do Machine Learning Approaches Perform Better Than Regression Models in Mapping Studies? A Systematic Review.
  • May 1, 2025
  • Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research
  • Tianqi Hong + 4 more

Do Machine Learning Approaches Perform Better Than Regression Models in Mapping Studies? A Systematic Review.

  • Research Article
  • 10.52783/jisem.v10i24s.3914
Explainable Machine Learning on Health Management Information System Data to Unveil Health Factors Affecting Maternal Mortality Ratio of Districts in India towards Achieving Sustainable Development Goals
  • Mar 24, 2025
  • Journal of Information Systems Engineering and Management
  • Siva Kumar Saragadam

Purpose: Maternal mortality remained to persist in many developing countries. India being the most populous developing country has several cultural differences and beliefs on health systems and may have disparities in receiving proper maternal health care. High data availability and under-utilization of data centric decision making is a key reason for ineffective performance of health systems. Enacting machine learning on such data to aid in sub-divisional policy formulation to address area level problems will eradicate major disparities in recipients of health services. Methods: Glass box machine learning models are trained on the data to obtain importance of features in defining the maternal mortality of a district. Furthermore, black box machine learning models are trained with hyper-parameter tuning and best model is chosen to perform explainable machine learning to generate explanations for each district prediction. A hybrid explainable machine learning approach is proposed on black-box machine learning models where Shapley Additive Explanation and Local Interpretable Model-agnostic Explanations are combined to generate final explanations. Results: There may be several differences even among nearby districts. Health Management Information System data is analyzed with help of Machine Learning techniques and Explainable Machine Learning techniques are used on the trained models to evaluate the contributing factors for each district. Conclusion: The factors that are specific to each district can help in formulating region specific health policies that minimize the disparities of progress of preventing maternal mortality over the districts of India. The paper has highlighted the advantages of using explainable machine learning in extracting complicated patterns of the data.

  • Research Article
  • 10.1016/j.chemosphere.2025.144777
Identifying soil drivers of rice productivity under fly ash and organic amendments using explainable machine learning.
  • Dec 1, 2025
  • Chemosphere
  • Soumyajeet Pradhan + 8 more

Identifying soil drivers of rice productivity under fly ash and organic amendments using explainable machine learning.

  • Research Article
  • Cite Count Icon 1
  • 10.1016/j.ifacol.2024.09.085
Efficient Milling Quality Prediction with Explainable Machine Learning
  • Jan 1, 2024
  • IFAC PapersOnLine
  • Dennis Gross + 4 more

Efficient Milling Quality Prediction with Explainable Machine Learning

  • Research Article
  • 10.3389/fneur.2025.1687144
Application of machine learning approaches to predict seizure-onset zones in patients with drug-resistant epilepsy: a systematic review
  • Jan 1, 2025
  • Frontiers in Neurology
  • Ali Haider Bangash + 4 more

Machine learning (ML) approaches have emerged as promising tools for improving seizure-onset zone (SOZ) prediction in patients with drug-resistant epilepsy (DRE). This systematic review aimed to evaluate the application and performance of ML approaches for SOZ prediction in patients with DRE. A comprehensive search was conducted across PubMed/MEDLINE, the Cochrane Database of Systematic Reviews, and Epistemonikos databases for studies employing ML algorithms for SOZ prediction in patients with DRE. The Quality Assessment of Diagnostic Accuracy Studies version 2 (QUADAS-2) tool was adopted to assess the methodological quality and risk of bias of included studies. Data on patient demographics, data acquisition methods, ML algorithms, and performance metrics were extracted and systematically synthesized. Out of a total of 38 studies, 15 studies met the inclusion criteria, encompassing 352 patients (mean age: 28 years, 34% female population). The studies employed various ML techniques, including traditional methods such as support vector machines and advanced deep learning architectures. Performance metrics varied widely across studies, with some approaches achieving accuracy, sensitivity, and specificity values above 90%. Deep learning models generally outperformed traditional methods, particularly in handling complex, multimodal data. Notably, personalized models demonstrated superior performance in reducing localization error and spatial dispersion. However, heterogeneity in data acquisition methods, patient populations, and reporting standards complicated direct comparisons between studies. This review highlighted the potential of ML approaches, particularly deep learning and personalized models, to enhance SOZ prediction accuracy in patients with DRE. However, several challenges were identified, including the need for standardized data collection protocols, larger prospective studies, and improved model interpretability. The findings underscore the importance of considering network-level changes in epilepsy when developing ML models for SOZ prediction. Although ML approaches show promise for improving surgical planning and outcomes in DRE, their clinical utility, particularly in complex epilepsy cases, requires further investigation. Addressing these challenges will be crucial in realizing the full potential of ML in enhancing epilepsy care.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 10
  • 10.1186/s12871-022-01888-y
Explainable machine learning approach to predict extubation in critically ill ventilated patients: a retrospective study in central Taiwan
  • Nov 14, 2022
  • BMC Anesthesiology
  • Kai-Chih Pai + 4 more

BackgroundWeaning from mechanical ventilation (MV) is an essential issue in critically ill patients, and we used an explainable machine learning (ML) approach to establish an extubation prediction model.MethodsWe enrolled patients who were admitted to intensive care units during 2015–2019 at Taichung Veterans General Hospital, a referral hospital in central Taiwan. We used five ML models, including extreme gradient boosting (XGBoost), categorical boosting (CatBoost), light gradient boosting machine (LightGBM), random forest (RF) and logistic regression (LR), to establish the extubation prediction model, and the feature window as well as prediction window was 48 h and 24 h, respectively. We further employed feature importance, Shapley additive explanations (SHAP) plot, partial dependence plot (PDP) and local interpretable model-agnostic explanations (LIME) for interpretation of the model at the domain, feature, and individual levels.ResultsWe enrolled 5,940 patients and found the accuracy was comparable among XGBoost, LightGBM, CatBoost and RF, with the area under the receiver operating characteristic curve using XGBoost to predict extubation was 0.921. The calibration and decision curve analysis showed well applicability of models. We also used the SHAP summary plot and PDP plot to demonstrate discriminative points of six key features in predicting extubation. Moreover, we employed LIME and SHAP force plots to show predicted probabilities of extubation and the rationale of the prediction at the individual level.ConclusionsWe developed an extubation prediction model with high accuracy and visualised explanations aligned with clinical workflow, and the model may serve as an autonomous screen tool for timely weaning.

  • Preprint Article
  • Cite Count Icon 1
  • 10.5194/egusphere-egu2020-690
Are Machine Learning methods robust enough for hydrological modeling under changing conditions?
  • Jul 17, 2020
  • Carolina Natel De Moura + 3 more

<p>The advancement of big data and increased computational power have contributed to an increased use of Machine Learning (ML) approaches in hydrological modelling. These approaches are powerful tools for modeling non-linear systems. However, the applicability of ML in non-stationary conditions needs to be studied further. As climate change will change hydrological patterns, testing ML approaches for non-stationary conditions is essential. Here, we used the Differential Split-Sample Test (DSST) to test the climate transposability of ML approaches (e.g., calibrating in a wet period and validating in a dry one, and vice-versa).  We applied five ML approaches using daily precipitation and temperature as input for the prediction of the daily discharge in six snow-dominated Swiss catchments. Lower and upper benchmarks were used to evaluate performances through a relative performance measure. The lower benchmark is the average of the bucket-type HBV model runs from 1000 random parameter sets. The upper benchmark is the automatically calibrated HBV model. In comparison with the stationary condition, the models performed slightly poorer in the non-stationary condition. The performance of simple ML approaches was poor for non-stationary conditions with an underestimation of peak flows, as well as a poor representation of the snow-melting period. On the other hand, a more complex ML approach (deep learning), the Long Short -Term Memory (LSTM), showed a good performance when compared with the lower and upper benchmarks. This might be explained by the fact that the so-called memory cell allowed to simulate the storage effects. </p>

  • Supplementary Content
  • Cite Count Icon 86
  • 10.2174/1573405613666170428154156
A Review of Denoising Medical Images Using Machine Learning Approaches
  • Oct 1, 2018
  • Current Medical Imaging Reviews
  • Prabhpreet Kaur + 2 more

Background: This paper attempts to identify suitable Machine Learning (ML) approach for image denoising of radiology based medical application. The Identification of ML approach is based on (i) Review of ML approach for denoising (ii) Review of suitable Medical Denoising approach.Discussion: The review focuses on six application of radiology: Medical Ultrasound (US) for fetus development, US Computer Aided Diagnosis (CAD) and detection for breast, skin lesions, brain tumor MRI diagnosis, X-Ray for chest analysis, Breast cancer using MRI imaging. This survey identifies the ML approach with better accuracy for medical diagnosis by radiologists. The image denoising approaches further includes basic filtering techniques, wavelet medical denoising, curvelet and optimization techniques. In most of the applications, the machine learning performance is better than the conventional image denoising techniques. For fast and computational results the radiologists are using the machine learning methods on MRI, US, X-Ray and Skin lesion images. The characteristics and contributions of different ML approaches are considered in this paper.Conclusion: The problem faced by the researchers during image denoising techniques and machine learning applications for clinical settings have also been discussed.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 138
  • 10.3390/fire2030043
Spatial Prediction of Wildfire Susceptibility Using Field Survey GPS Data and Machine Learning Approaches
  • Jul 28, 2019
  • Fire
  • Omid Ghorbanzadeh + 6 more

Recently, global climate change discussions have become more prominent, and forests are considered as the ecosystems most at risk by the consequences of climate change. Wildfires are among one of the main drivers leading to losses in forested areas. The increasing availability of free remotely sensed data has enabled the precise locations of wildfires to be reliably monitored. A wildfire data inventory was created by integrating global positioning system (GPS) polygons with data collected from the moderate resolution imaging spectroradiometer (MODIS) thermal anomalies product between 2012 and 2017 for Amol County, northern Iran. The GPS polygon dataset from the state wildlife organization was gathered through extensive field surveys. The integrated inventory dataset, along with sixteen conditioning factors (topographic, meteorological, vegetation, anthropological, and hydrological factors), was used to evaluate the potential of different machine learning (ML) approaches for the spatial prediction of wildfire susceptibility. The applied ML approaches included an artificial neural network (ANN), support vector machines (SVM), and random forest (RF). All ML approaches were trained using 75% of the wildfire inventory dataset and tested using the remaining 25% of the dataset in the four-fold cross-validation (CV) procedure. The CV method is used for dealing with the randomness effects of the training and testing dataset selection on the performance of applied ML approaches. To validate the resulting wildfire susceptibility maps based on three different ML approaches and four different folds of inventory datasets, the true positive and false positive rates were calculated. In the following, the accuracy of each of the twelve resulting maps was assessed through the receiver operating characteristics (ROC) curve. The resulting CV accuracies were 74%, 79% and 88% for the ANN, SVM and RF, respectively.

  • Research Article
  • Cite Count Icon 11
  • 10.1016/j.measen.2023.100925
A comprehensive comparison of machine learning approaches with hyper-parameter tuning for smartphone sensor-based human activity recognition
  • Oct 17, 2023
  • Measurement: Sensors
  • Vasundhara Ghate + 1 more

A comprehensive comparison of machine learning approaches with hyper-parameter tuning for smartphone sensor-based human activity recognition

  • Research Article
  • Cite Count Icon 10
  • 10.1124/jpet.122.001551
Quantitative Systems Pharmacology and Machine Learning: A Match Made in Heaven or Hell?
  • Aug 31, 2023
  • The Journal of pharmacology and experimental therapeutics
  • Marcus John Tindall + 3 more

As pharmaceutical development moves from early-stage in vitro experimentation to later in vivo and subsequent clinical trials, data and knowledge are acquired across multiple time and length scales, from the subcellular to whole patient cohort scale. Realizing the potential of this data for informing decision making in pharmaceutical development requires the individual and combined application of machine learning (ML) and mechanistic multiscale mathematical modeling approaches. Here we outline how these two approaches, both individually and in tandem, can be applied at different stages of the drug discovery and development pipeline to inform decision making compound development. The importance of discerning between knowledge and data are highlighted in informing the initial use of ML or mechanistic quantitative systems pharmacology (QSP) models. We discuss the application of sensitivity and structural identifiability analyses of QSP models in informing future experimental studies to which ML may be applied, as well as how ML approaches can be used to inform mechanistic model development. Relevant literature studies are highlighted and we close by discussing caveats regarding the application of each approach in an age of constant data acquisition. SIGNIFICANCE STATEMENT: We consider when best to apply machine learning (ML) and mechanistic quantitative systems pharmacology (QSP) approaches in the context of the drug discovery and development pipeline. We discuss the importance of prior knowledge and data available for the system of interest and how this informs the individual and combined application of ML and QSP approaches at each stage of the pipeline.

  • Research Article
  • Cite Count Icon 47
  • 10.1002/aps3.11371
Plants meet machines: Prospects in machine learning for plant biology
  • Jun 1, 2020
  • Applications in Plant Sciences
  • Pamela S Soltis + 3 more

Plants meet machines: Prospects in machine learning for plant biology

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.