B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers.

Moritz Böhle,Bernt Schiele,Navdeeppal Singh,Mario Fritz

doi:10.1109/tpami.2024.3355155

Abstract

We present a new direction for increasing the interpretability of deep neural networks (DNNs) by promoting weight-input alignment during training. For this, we propose to replace the linear transformations in DNNs by our novel B-cos transformation. As we show, a sequence (network) of such transformations induces a single linear transformation that faithfully summarises the full model computations. Moreover, the B-cos transformation is designed such that the weights align with relevant signals during optimisation. As a result, those induced linear transformations become highly interpretable and highlight task-relevant features. Importantly, the B-cos transformation is designed to be compatible with existing architectures and we show that it can easily be integrated into virtually all of the latest state of the art models for computer vision-e.g. ResNets, DenseNets, ConvNext models, as well as Vision Transformers-by combining the B-cos-based explanations with normalisation and attention layers, all whilst maintaining similar accuracy on ImageNet. Finally, we show that the resulting explanations are of high visual quality and perform well under quantitative interpretability metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Similar Papers

B-cos Networks: Alignment is All We Need for Interpretability
Moritz Bohle ... Mario Fritz
-
Moritz Bohle, et. al.Moritz Bohle ... Mario Fritz
01 Jun 2022
01 Jun 2022

Improving Interpretability of Deep Neural Networks with Semantic Information
Yinpeng Dong ... Bo Zhang
-
Yinpeng Dong, et. al.Yinpeng Dong ... Bo Zhang
01 Jul 2017
01 Jul 2017

On Interpretability of Artificial Neural Networks: A Survey
Feng-Lei Fan ... Ge Wang
IEEE Transactions on Radiation and Plasma Medical Sciences | VOL. 5
Feng-Lei Fan, et. al.Feng-Lei Fan ... Ge Wang
01 Nov 2021
IEEE Transactions on Radiation and Plasma Medical Sciences | VOL. 5

Transparency of deep neural networks for medical image analysis: A review of interpretability methods
Zohaib Salahuddin ... Philippe Lambin
Computers in Biology and Medicine | VOL. 140
Zohaib Salahuddin, et. al.Zohaib Salahuddin ... Philippe Lambin
04 Dec 2021
Computers in Biology and Medicine | VOL. 140

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence