Multi-Objective Counterfactual Explanations

Susanne Dandl,Martin Binder,Christoph Molnar,Bernd Bischl

doi:10.1007/978-3-030-58112-1_31

Susanne Dandl, Martin Binder + Show 2 more

Open Access

https://doi.org/10.1007/978-3-030-58112-1_31

Copy DOI

Publication Date: Jan 1, 2020
Citations: 135	License type: CC BY 4.0

Affiliation: LMU Klinikum, Ludwig-Maximilians-Universität München

Abstract

Counterfactual explanations are one of the most popular methods to make predictions of black box machine learning models interpretable by providing explanations in the form of `what-if scenarios'. Most current approaches optimize a collapsed, weighted sum of multiple objectives, which are naturally difficult to balance a-priori. We propose the Multi-Objective Counterfactuals (MOC) method, which translates the counterfactual search into a multi-objective optimization problem. Our approach not only returns a diverse set of counterfactuals with different trade-offs between the proposed objectives, but also maintains diversity in feature space. This enables a more detailed post-hoc analysis to facilitate better understanding and also more options for actionable user responses to change the predicted outcome. Our approach is also model-agnostic and works for numerical and categorical input features. We show the usefulness of MOC in concrete cases and compare our approach with state-of-the-art methods for counterfactual explanations.

Highlights

Interpretable machine learning methods have become very important in recent years to explain the behavior of black box machine learning (ML) models
We propose the Multi-Objective Counterfactuals (MOC) method, which translates the counterfactual search into a multi-objective optimization problem
We introduce Multi-Objective Counterfactuals (MOC), which to the best of our knowledge is the first method to formalize the counterfactual search as a multi-objective optimization problem

Summary

Introduction

Interpretable machine learning methods have become very important in recent years to explain the behavior of black box machine learning (ML) models. A useful method for explaining single predictions of a model are counterfactual explanations. For people whose credit applications have been rejected, it is valuable to know why they have not been accepted, either to understand the decision making process or to assess their actionable options to change the outcome. Counterfactuals provide these explanations in the form of “if these features had different values, your credit application would have been accepted”. For such explanations to be plausible, they should only suggest small changes in a few features

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Objective Counterfactual Explanations

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The Privacy Issue of Counterfactual Explanations: Explanation Linkage Attacks
Sofie Goethals ... Kenneth Sörensen
ACM Transactions on Intelligent Systems and Technology | VOL. 14
Sofie Goethals, et. al.Sofie Goethals ... Kenneth Sörensen
11 Aug 2023
ACM Transactions on Intelligent Systems and Technology | VOL. 14

Categorical and Continuous Features in Counterfactual Explanations of AI Systems
Greta Warren ... Mark T Keane
ACM Transactions on Interactive Intelligent Systems | VOL. -
Greta Warren, et. al.Greta Warren ... Mark T Keane
20 Jun 2024
ACM Transactions on Interactive Intelligent Systems | VOL. -

Increasing trust in complex machine learning systems
Jaehun Kim
ACM SIGIR Forum | VOL. 55
Jaehun KimJaehun Kim
01 Jun 2021
ACM SIGIR Forum | VOL. 55

Robot Failure Mode Prediction with Explainable Machine Learning
Aneseh Alvanpour ... Christopher Kevin Robinson
-
Aneseh Alvanpour, et. al.Aneseh Alvanpour ... Christopher Kevin Robinson
01 Aug 2020
01 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Objective Counterfactual Explanations

Abstract

Highlights

Summary

Talk to us

Similar Papers