Investigating the impact of calibration on the quality of explanations

Helena Löfström,Ulf Johansson,Cecilia Sönströd,Tuwe Löfström

doi:10.1007/s10472-023-09837-2

Abstract

AbstractPredictive models used in Decision Support Systems (DSS) are often requested to explain the reasoning to users. Explanations of instances consist of two parts; the predicted label with an associated certainty and a set of weights, one per feature, describing how each feature contributes to the prediction for the particular instance. In techniques like Local Interpretable Model-agnostic Explanations (LIME), the probability estimate from the underlying model is used as a measurement of certainty; consequently, the feature weights represent how each feature contributes to the probability estimate. It is, however, well-known that probability estimates from classifiers are often poorly calibrated, i.e., the probability estimates do not correspond to the actual probabilities of being correct. With this in mind, explanations from techniques like LIME risk becoming misleading since the feature weights will only describe how each feature contributes to the possibly inaccurate probability estimate. This paper investigates the impact of calibrating predictive models before applying LIME. The study includes 25 benchmark data sets, using Random forest and Extreme Gradient Boosting (xGBoost) as learners and Venn-Abers and Platt scaling as calibration methods. Results from the study show that explanations of better calibrated models are themselves better calibrated, with ECE and log loss for the explanations after calibration becoming more conformed to the model ECE and log loss. The conclusion is that calibration makes the models and the explanations better by accurately representing reality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Annals of Mathematics and Artificial Intelligence	Publication Date: Mar 13, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Investigating the impact of calibration on the quality of explanations

Abstract

Talk to us

Similar Papers

More From: Annals of Mathematics and Artificial Intelligence

Lead the way for us

Similar Papers

Investigation on explainable machine learning models to predict chronic kidney diseases
Samit Kumar Ghosh ... Ahsan H Khandoker
Scientific reports | VOL. 14
Samit Kumar Ghosh, et. al.Samit Kumar Ghosh ... Ahsan H Khandoker
14 Feb 2024
Scientific reports | VOL. 14

Explainable artificial intelligence model for identifying COVID-19 gene biomarkers
Fatma Hilal Yagin ... Sami Akbulut
Computers in Biology and Medicine | VOL. 154
Fatma Hilal Yagin, et. al.Fatma Hilal Yagin ... Sami Akbulut
01 Feb 2023
Computers in Biology and Medicine | VOL. 154

Global and local interpretability techniques of supervised machine learning black box models for numerical medical data
Hajar Hakkoum ... Ibtissam Abnane
Engineering Applications of Artificial Intelligence | VOL. 131
Hajar Hakkoum, et. al.Hajar Hakkoum ... Ibtissam Abnane
09 Jan 2024
Engineering Applications of Artificial Intelligence | VOL. 131

Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.
Huan Wang ... Nan Xu
JMIR Medical Informatics | VOL. 9
Huan Wang, et. al.Huan Wang ... Nan Xu
19 Nov 2021
JMIR Medical Informatics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigating the impact of calibration on the quality of explanations

Abstract

Talk to us

Similar Papers

More From: Annals of Mathematics and Artificial Intelligence