Interpretability in healthcare: A comparative study of local machine learning interpretability techniques

Radwa Elshawi,Mouaz Al‐Mallah,Youssef Sherif,Sherif Sakr

doi:10.1111/coin.12410

Abstract

AbstractAlthough complex machine learning models (eg, random forest, neural networks) are commonly outperforming the traditional and simple interpretable models (eg, linear regression, decision tree), in the healthcare domain, clinicians find it hard to understand and trust these complex models due to the lack of intuition and explanation of their predictions. With the new general data protection regulation (GDPR), the importance for plausibility and verifiability of the predictions made by machine learning models has become essential. Hence, interpretability techniques for machine learning models are an area focus of research. In general, the main aim of these interpretability techniques is to shed light and provide insights into the prediction process of the machine learning models and to be able to explain how the results from the prediction was generated. A major problem in this context is that both the quality of the interpretability techniques and trust of the machine learning model predictions are challenging to measure. In this article, we propose four fundamental quantitative measures for assessing the quality of interpretability techniques—similarity,bias detection,execution time, andtrust. We present a comprehensive experimental evaluation of six recent and popularlocalmodel agnostic interpretability techniques, namely,LIME,SHAP,Anchors,LORE,ILIME“ andMAPLEon different types of real‐world healthcare data. Building on previous work, our experimental evaluation covers different aspects for its comparison includingidentity,stability,separability,similarity,execution time,bias detection, andtrust. The results of our experiments show that MAPLE achieves the highest performance for the identity across all data sets included in this study, while LIME achieves the lowest performance for the identity metric. LIME achieves the highest performance for the separability metric across all data sets. On average, SHAP has the smallest average time to output explanation across all data sets included in this study. For detecting the bias, SHAP and MAPLE enable the participants to better detect the bias. For the trust metric, Anchors achieves the highest performance on all data sets included in this work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interpretability in healthcare: A comparative study of local machine learning interpretability techniques

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence

Lead the way for us

Journal: Computational Intelligence	Publication Date: Nov 24, 2020
Citations: 76

Similar Papers

Interpretability in HealthCare A Comparative Study of Local Machine Learning Interpretability Techniques
Radwa El Shawi ... Mouaz Al-Mallah
-
Radwa El Shawi, et. al.Radwa El Shawi ... Mouaz Al-Mallah
01 Jun 2019
01 Jun 2019

On the interpretability of the SVM model for predicting infant mortality in Bangladesh
Md Abu Sayeed ... Rumana Rois
Journal of Health, Population and Nutrition | VOL. 43
Md Abu Sayeed, et. al.Md Abu Sayeed ... Rumana Rois
27 Oct 2024
Journal of Health, Population and Nutrition | VOL. 43

On the interpretability of machine learning-based model for predicting hypertension
Radwa Elshawi ... Sherif Sakr
BMC Medical Informatics and Decision Making | VOL. 19
Radwa Elshawi, et. al.Radwa Elshawi ... Sherif Sakr
29 Jul 2019
BMC Medical Informatics and Decision Making | VOL. 19

Performance evaluation of ML models for preoperative prediction of HER2-low BC based on CE-CBBCT radiomic features: A prospective study.
Xianfei Chen ... Xueli Liang
Medicine | VOL. 103
Xianfei Chen, et. al.Xianfei Chen ... Xueli Liang
14 Jun 2024
Medicine | VOL. 103

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interpretability in healthcare: A comparative study of local machine learning interpretability techniques

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence