DeforestVis: Behaviour Analysis of Machine Learning Models with Surrogate Decision Stumps

Angelos Chatzimparmpas,Rafeal M Martins,Andreas Kerren,Alexandru C Telea

doi:10.1111/cgf.15004

Angelos Chatzimparmpas, Rafeal M Martins + Show 2 more

Open Access

https://doi.org/10.1111/cgf.15004

Copy DOI

Journal: Computer Graphics Forum	Publication Date: Feb 27, 2024
Citations: 1	License type: CC BY 4.0

Affiliation: Linnaeus University

Abstract

AbstractAs the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model‐agnostic, way to interpret such models is to train surrogate models—such as rule sets and decision trees—that sufficiently approximate the original ones while being simpler and easier‐to‐explain. Yet, rule sets can become very lengthy, with many if–else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal—providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers summarization of the behaviour of complex ML models by providing surrogate decision stumps (one‐level decision trees) generated with the Adaptive Boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity versus fidelity trade‐off by incrementally generating more stumps, creating attribute‐based explanations with weighted stumps to justify decision making, and analysing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case‐by‐case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeforestVis: Behaviour Analysis of Machine Learning Models with Surrogate Decision Stumps

Abstract

Talk to us

Similar Papers

More From: Computer Graphics Forum

Lead the way for us

Similar Papers

Evaluating external generalizability of machine learning models for recycled aggregate concrete property prediction
Shreyas Pandurang Jadhav ... Nikhil Bugalia
Journal of Cleaner Production | VOL. 469
Shreyas Pandurang Jadhav, et. al.Shreyas Pandurang Jadhav ... Nikhil Bugalia
15 Jul 2024
Journal of Cleaner Production | VOL. 469

Logistic regression technique is comparable to complex machine learning algorithms in predicting cognitive impairment related to post intensive care syndrome
Tingting Wu ... Yueqing Wei
Scientific Reports | VOL. 13
Tingting Wu, et. al.Tingting Wu ... Yueqing Wei
11 Feb 2023
Scientific Reports | VOL. 13

‘Emerging proxies’ in information-rich machine learning: a threat to fairness?
Aidan James Mcloughney ... Marc Cheong
-
Aidan James Mcloughney, et. al.Aidan James Mcloughney ... Marc Cheong
18 May 2023
18 May 2023

Distance-Restricted Explanations: Theoretical Underpinnings & Efficient Implementation
Yacine Izza ... Joao Marques-Silva
-
Yacine Izza, et. al.Yacine Izza ... Joao Marques-Silva
01 Nov 2024
01 Nov 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeforestVis: Behaviour Analysis of Machine Learning Models with Surrogate Decision Stumps

Abstract

Talk to us

Similar Papers

More From: Computer Graphics Forum