The Shapley Value of Classifiers in Ensemble Games

Benedek Rozemberczki,Rik Sarkar

doi:10.1145/3459637.3482302

Abstract

What is the value of an individual model in an ensemble of binary classifiers? We answer this question by introducing a class of transferable utility cooperative games called ensemble games. In machine learning ensembles, pre-trained models cooperate to make classification decisions. To quantify the importance of models in these ensemble games, we define Troupe - an efficient algorithm that allocates payoffs based on approximate Shapley values of the classifiers. We argue that the Shapley value of models in these games is an effective decision metric for choosing a high-performing subset of models from the ensemble. Our analytical findings prove that our Shapley value estimation scheme is precise and scalable; its performance increases with the size of the dataset and ensemble. Empirical results on real-world graph classification tasks demonstrate that our algorithm produces high-quality estimates of the Shapley value. We find that Shapley values can be utilized for ensemble pruning and that adversarial models receive a low valuation. Complex classifiers are frequently found to be responsible for both correct and incorrect classification decisions.

Highlights

The advent of black box machine learning models raised fundamental questions about how input features and individual training data points contribute to the decisions of expert systems [17, 28]
We argue that the Shapley value [41], a solution concept from cooperative game theory, is a model importance metric
We propose Troupe, an algorithm which approximates the average of Shapley values in ensemble games and dual games using data

Summary

Introduction

The advent of black box machine learning models raised fundamental questions about how input features and individual training data points contribute to the decisions of expert systems [17, 28]. There has been interest in how the heterogeneity of models in an ensemble results in heterogeneous contributions of those to the classification decisions of the ensemble [16, 47]. For example one would assume that computer vision, credit scoring and fraud detection systems which were trained on varying quality proprietary datasets output labels for data points with varying accuracy. Another source of varying model performance can be the complexity of models e.g. the number of weights in a neural network or the depth of a classification tree.

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Shapley Value of Classifiers in Ensemble Games

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Oct 26, 2021
Citations: 15	License type: cc-by

Similar Papers

A comparative study of recurrent neural network models for lexical domain classification
Suman Ravuri ... Andreas Stolcke
-
Suman Ravuri, et. al.Suman Ravuri ... Andreas Stolcke
01 Mar 2016
01 Mar 2016

Algorithms for the shapley and myerson values in graph-restricted games
...
-
, et. al. ...
05 May 2014
05 May 2014

Game Model Solution of Aircraft Deicing Operation Scheduling Based on Shapley Value
...
Applied Mechanics and Materials | VOL. 253-255
, et. al. ...
01 Dec 2012
Applied Mechanics and Materials | VOL. 253-255

Games with a Permission Structure: A Survey on Generalizations and Applications
Rene Van Den Brink
SSRN Electronic Journal | VOL. -
Rene Van Den BrinkRene Van Den Brink
01 Jan 2017
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Shapley Value of Classifiers in Ensemble Games

Abstract

Highlights

Summary

Talk to us

Similar Papers