Variant Approach for Identifying Spurious Relations That Deep Learning Models Learn

Tobias Tesch,Stefan Kollet,Jochen Garcke

doi:10.3389/frwa.2021.745563

Abstract

A deep learning (DL) model learns a function relating a set of input variables with a set of target variables. While the representation of this function in form of the DL model often lacks interpretability, several interpretation methods exist that provide descriptions of the function (e.g., measures of feature importance). On the one hand, these descriptions may build trust in the model or reveal its limitations. On the other hand, they may lead to new scientific understanding. In any case, a description is only useful if one is able to identify if parts of it reflect spurious instead of causal relations (e.g., random associations in the training data instead of associations due to a physical process). However, this can be challenging even for experts because, in scientific tasks, causal relations between input and target variables are often unknown or extremely complex. Commonly, this challenge is addressed by training separate instances of the considered model on random samples of the training set and identifying differences between the obtained descriptions. Here, we demonstrate that this may not be sufficient and propose to additionally consider more general modifications of the prediction task. We refer to the proposed approach as variant approach and demonstrate its usefulness and its superiority over pure sampling approaches with two illustrative prediction tasks from hydrometeorology. While being conceptually simple, to our knowledge the approach has not been formalized and systematically evaluated before.

Highlights

A deep learning (DL) model learns a function relating a set of input variables with a set of target variables
Given a description d ∈ Rd of the function that a statistical model learned during a training phase, we proposed a variant approach for the identification of parts of d that reflect spurious relations
For the water level prediction task, where formally specifying the assumed variation of causal relations was more involved, we found the formal evaluation of distances to be of limited use

Summary

Introduction

A deep learning (DL) model learns a function relating a set of input variables with a set of target variables. While DL models excel in terms of predictive performance, the representation of the learned function in form of the DL model (e.g., in form of a neural network) often lacks interpretability. To address this lack of interpretability, several interpretation methods have been developed (see e.g., Gilpin et al, 2018; Montavon et al, 2018; Zhang and Zhu, 2018; Molnar, 2019; Samek et al, 2021) providing descriptions of the learned function (e.g., measures of feature importance, FI).

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Water	Publication Date: Sep 13, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Variant Approach for Identifying Spurious Relations That Deep Learning Models Learn

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Water

Lead the way for us

Similar Papers

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Aidan Gilson
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Aidan Gilson
01 Jul 2021
Cancer Research | VOL. 81

P–260 Towards better explainable deep learning models for embryo selection in ART
...
Human Reproduction | VOL. 36
, et. al. ...
06 Aug 2021
Human Reproduction | VOL. 36

Deep Learning Improves Speed and Accuracy of Prostate Gland Segmentations on Magnetic Resonance Imaging for Targeted Biopsy.
Simon John Christoph Soerensen ... Indrani Bhattacharya
Journal of Urology | VOL. 206
Simon John Christoph Soerensen, et. al.Simon John Christoph Soerensen ... Indrani Bhattacharya
21 Apr 2021
Journal of Urology | VOL. 206

Learning from unlabelled real seismic data: Fault detection based on transfer learning
Ruoshui Zhou ... Guangmin Hu
Geophysical Prospecting | VOL. 69
Ruoshui Zhou, et. al.Ruoshui Zhou ... Guangmin Hu
06 Jun 2021
Geophysical Prospecting | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variant Approach for Identifying Spurious Relations That Deep Learning Models Learn

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Water