On the robustness of sparse counterfactual explanations to adverse perturbations

Marco Virgolin,Saverio Fracaros

doi:10.1016/j.artint.2022.103840

Marco Virgolin, Saverio Fracaros

Open Access

https://doi.org/10.1016/j.artint.2022.103840

Copy DOI

Abstract

Counterfactual explanations (CEs) are a powerful means for understanding how decisions made by algorithms can be changed. Researchers have proposed a number of desiderata that CEs should meet to be practically useful, such as requiring minimal effort to enact, or complying with causal models. In this paper, we consider the interplay between the desiderata of robustness (i.e., that enacting CEs remains feasible and cost-effective even if adverse events take place) and sparsity (i.e., that CEs require only a subset of the features to be changed). In particular, we study the effect of addressing robustness separately for the features that are recommended to be changed and those that are not. We provide definitions of robustness for sparse CEs that are workable in that they can be incorporated as penalty terms in the loss functions that are used for discovering CEs. To carry out our experiments, we create and release code where five data sets (commonly used in the field of fair and explainable machine learning) have been enriched with feature-specific annotations that can be used to sample meaningful perturbations. Our experiments show that CEs are often not robust and, if adverse perturbations take place (even if not worst-case), the intervention they prescribe may require a much larger cost than anticipated, or even become impossible. However, accounting for robustness in the search process, which can be done rather easily, allows discovering robust CEs systematically. Robust CEs make additional intervention to contrast perturbations much less costly than non-robust CEs. We also find that robustness is easier to achieve for the features to change, posing an important point of consideration for the choice of what counterfactual explanation is best for the user. Our code is available at: https://github.com/marcovirgolin/robust-counterfactuals.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial Intelligence	Publication Date: Dec 16, 2022
Citations: 14	License type: cc-by

R Discovery Prime

R Discovery Prime

On the robustness of sparse counterfactual explanations to adverse perturbations

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence

Lead the way for us

Similar Papers

Interval abstractions for robust counterfactual explanations
Junqi Jiang ... Francesca Toni
Artificial Intelligence | VOL. 336
Junqi Jiang, et. al.Junqi Jiang ... Francesca Toni
02 Sep 2024
Artificial Intelligence | VOL. 336

Disagreement amongst counterfactual explanations: how transparency can be misleading
Dieter Brughmans ... Lissa Melis
TOP | VOL. 32
Dieter Brughmans, et. al.Dieter Brughmans ... Lissa Melis
08 May 2024
TOP | VOL. 32

Finding Regions of Counterfactual Explanations via Robust Optimization
Donato Maragno ... Jannis Kurtz
INFORMS Journal on Computing | VOL. -
Donato Maragno, et. al.Donato Maragno ... Jannis Kurtz
22 Feb 2024
INFORMS Journal on Computing | VOL. -

Algorithmic Recourse
Amir-Hossein Karimi ... Isabel Valera
-
Amir-Hossein Karimi, et. al.Amir-Hossein Karimi ... Isabel Valera
01 Mar 2021
01 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the robustness of sparse counterfactual explanations to adverse perturbations

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence