Comparison of feature importance measures as explanations for classification models

Mirka Saarela,Susanne Jauhiainen

doi:10.1007/s42452-021-04148-9

Mirka Saarela, Susanne Jauhiainen

Open Access

https://doi.org/10.1007/s42452-021-04148-9

Copy DOI

Journal: SN Applied Sciences	Publication Date: Feb 1, 2021
Citations: 172	License type: open-access

Affiliation: University of Jyväskylä

Abstract

Explainable artificial intelligence is an emerging research direction helping the user or developer of machine learning models understand why models behave the way they do. The most popular explanation technique is feature importance. However, there are several different approaches how feature importances are being measured, most notably global and local. In this study we compare different feature importance measures using both linear (logistic regression with L1 penalization) and non-linear (random forest) methods and local interpretable model-agnostic explanations on top of them. These methods are applied to two datasets from the medical domain, the openly available breast cancer data from the UCI Archive and a recently collected running injury data. Our results show that the most important features differ depending on the technique. We argue that a combination of several explanation techniques could provide more reliable and trustworthy results. In particular, local explanations should be used in the most critical cases such as false negatives.

Highlights

Classification models have two main objectives [9]
We focus on feature importance or saliency techniques, that is, techniques that explain the decision of an algorithm by assigning values that reflect the importance of input components in their contribution to that decision [36]
Bolded are the nine features detected with random forest and nine most important features with regression, ranked based on the p-value

Summary

Introduction

Classification models have two main objectives [9] They should perform well, meaning they should forecast the output for new given input features as accurately as possible. Simple linear classification models are easy to understand and interpret but typically perform worse than non-linear models [10, 19, 24, 44], while complex prediction models with non-linear combinations of features tend to perform better (e.g., [32, 33, 41]) but are less interpretable In other words, they often do a better job in classifying new instances correctly, but the reasons why a certain classification was made is hidden. These models often do not provide enough insight to the classification, which would be needed to employ them in sensitive domains

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of feature importance measures as explanations for classification models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SN Applied Sciences

Lead the way for us

Similar Papers

Local Interpretable Classifier Explanations with Self-generated Semantic Features
Fabrizio Angiulli ... Fabio Fassetti
-
Fabrizio Angiulli, et. al.Fabrizio Angiulli ... Fabio Fassetti
01 Jan 2020
01 Jan 2020

Explainable artificial intelligence model for identifying COVID-19 gene biomarkers
Fatma Hilal Yagin ... Sami Akbulut
Computers in Biology and Medicine | VOL. 154
Fatma Hilal Yagin, et. al.Fatma Hilal Yagin ... Sami Akbulut
01 Feb 2023
Computers in Biology and Medicine | VOL. 154

Unraveling Scenario-Based Behavior of a Self-Learning Function with User Interaction
Marco Stang ... Eric Sax
-
Marco Stang, et. al.Marco Stang ... Eric Sax
01 Jan 2023
01 Jan 2023

Detection of risk factors of PCOS patients with Local Interpretable Model-agnostic Explanations (LIME) Method that an explainable artificial intelligence model
İpek Balikçi Çi̇çek ... Fatma Hilal Yağin
The Journal of Cognitive Systems | VOL. 6
İpek Balikçi Çi̇çek, et. al.İpek Balikçi Çi̇çek ... Fatma Hilal Yağin
30 Dec 2021
The Journal of Cognitive Systems | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of feature importance measures as explanations for classification models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SN Applied Sciences