Gbt-HIPS: Explaining the Classifications of Gradient Boosted Tree Ensembles

Julian Hatwell,Mohamed Medhat Gaber,R Muhammad Atif Azad

doi:10.3390/app11062511

Julian Hatwell, Mohamed Medhat Gaber + Show 1 more

Open Access

https://doi.org/10.3390/app11062511

Copy DOI

Journal: Applied Sciences	Publication Date: Mar 11, 2021
Citations: 6	License type: CC BY 4.0

Affiliation: Birmingham City University

Abstract

This research presents Gradient Boosted Tree High Importance Path Snippets (gbt-HIPS), a novel, heuristic method for explaining gradient boosted tree (GBT) classification models by extracting a single classification rule (CR) from the ensemble of decision trees that make up the GBT model. This CR contains the most statistically important boundary values of the input space as antecedent terms. The CR represents a hyper-rectangle of the input space inside which the GBT model is, very reliably, classifying all instances with the same class label as the explanandum instance. In a benchmark test using nine data sets and five competing state-of-the-art methods, gbt-HIPS offered the best trade-off between coverage (0.16–0.75) and precision (0.85–0.98). Unlike competing methods, gbt-HIPS is also demonstrably guarded against under- and over-fitting. A further distinguishing feature of our method is that, unlike much prior work, our explanations also provide counterfactual detail in accordance with widely accepted recommendations for what makes a good explanation.

Highlights

Data Analytics and Artificial Intelligence Research Group, Faculty of Computing, Engineering and the Built Environment, Birmingham City University, Curzon Street, Birmingham B5 5JU, UK; Abstract: This research presents Gradient Boosted Tree High Importance Path Snippets, a novel, heuristic method for explaining gradient boosted tree (GBT) classification models by extracting a single classification rule (CR) from the ensemble of decision trees that make up the GBT model
Such tasks are often still found in high-stakes decision making domains, such as medical decision making [5,6,7,8]; justice and law [9,10]; financial services [11,12,13]; and defence and military intelligence [14]. In these and similar domains, there is a high burden of accountability for decision makers to explain the reasoning behind their decisions. This burden only increases with the introduction of machine learning (ML) into decision making processes [15]
interpretable machine learning (IML) methods can be used to facilitate the interpretation of a GBT model, as well as other types of decision tree ensemble, known as decision forests (DFs)

Summary

Introduction

Data Analytics and Artificial Intelligence Research Group, Faculty of Computing, Engineering and the Built Environment, Birmingham City University, Curzon Street, Birmingham B5 5JU, UK; Abstract: This research presents Gradient Boosted Tree High Importance Path Snippets (gbt-HIPS), a novel, heuristic method for explaining gradient boosted tree (GBT) classification models by extracting a single classification rule (CR) from the ensemble of decision trees that make up the GBT model. A further distinguishing feature of our method is that, unlike much prior work, our explanations provide counterfactual detail in accordance with widely accepted recommendations for what makes a good explanation In these and similar domains, there is a high burden of accountability for decision makers to explain the reasoning behind their decisions. IML methods can be used to facilitate the interpretation of a GBT model, as well as other types of decision tree ensemble, known as decision forests (DFs). These methods generate a cascading rule list (CRL) as an inherently interpretable proxy model.

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gbt-HIPS: Explaining the Classifications of Gradient Boosted Tree Ensembles

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Utility of single versus sequential measurements of risk factors for prediction of stroke in Chinese adults
Matthew Chun ...
Scientific Reports | VOL. 11
Matthew Chun, et. al.Matthew Chun ...
02 Sep 2021
Scientific Reports | VOL. 11

Continuous time recurrent neural networks: Overview and benchmarking at forecasting blood glucose in the intensive care unit
Oisin Fitzgerald ... Louisa Jorm
Journal of Biomedical Informatics | VOL. 146
Oisin Fitzgerald, et. al.Oisin Fitzgerald ... Louisa Jorm
10 Sep 2023
Journal of Biomedical Informatics | VOL. 146

Machine learning-based normal tissue complication probability model for predicting albumin-bilirubin (ALBI) grade increase in hepatocellular carcinoma patients
Anussara Prayongrat ... Sira Sriswasdi
Radiation Oncology | VOL. 17
Anussara Prayongrat, et. al.Anussara Prayongrat ... Sira Sriswasdi
07 Dec 2022
Radiation Oncology | VOL. 17

Exploring Machine Learning in Deep Foundation and Soil Classification Application
Mohammad Moontakim Shoaib
-
Mohammad Moontakim ShoaibMohammad Moontakim Shoaib
05 Jun 2023
05 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gbt-HIPS: Explaining the Classifications of Gradient Boosted Tree Ensembles

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences