Which neural network makes more explainable decisions? An approach towards measuring explainability

Mengdi Zhang,Jun Sun,Jingyi Wang

doi:10.1007/s10515-022-00338-w

Mengdi Zhang, Jun Sun + Show 1 more

Open Access

https://doi.org/10.1007/s10515-022-00338-w

Copy DOI

Abstract

Neural networks are getting increasingly popular thanks to their exceptional performance in solving many real-world problems. At the same time, they are shown to be vulnerable to attacks, difficult to debug and subject to fairness issues. To improve people’s trust in the technology, it is often necessary to provide some human-understandable explanation of neural networks’ decisions, e.g., why is that my loan application is rejected whereas hers is approved? That is, the stakeholder would be interested to minimize the chances of not being able to explain the decision consistently and would like to know how often and how easy it is to explain the decisions of a neural network before it is deployed. In this work, we provide two measurements on the decision explainability of neural networks. Afterwards, we develop algorithms for evaluating the measurements of user-provided neural networks automatically. We evaluate our approach on multiple neural network models trained on benchmark datasets. The results show that existing neural networks’ decisions often have low explainability according to our measurements. This is in line with the observation that adversarial samples can be easily generated through adversarial perturbation, which are often hard to explain. Our further experiments show that the decisions of the models trained with robust training are not necessarily easier to explain, whereas decisions of the models retrained with samples generated by our algorithms are easier to explain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Automated Software Engineering	Publication Date: Apr 9, 2022
Citations: 2	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Which neural network makes more explainable decisions? An approach towards measuring explainability

Abstract

Talk to us

Similar Papers

More From: Automated Software Engineering

Lead the way for us

Similar Papers

DIPA: Adversarial Attack on DNNs by Dropping Information and Pixel-Level Attack on Attention
Jing Liu ... Huailin Liu
Information | VOL. 15
Jing Liu, et. al.Jing Liu ... Huailin Liu
03 Jul 2024
Information | VOL. 15

A survey of robust adversarial training in pattern recognition: Fundamental, theory, and methodologies
Zhuang Qian ... Xu-Yao Zhang
Pattern Recognition | VOL. 131
Zhuang Qian, et. al.Zhuang Qian ... Xu-Yao Zhang
05 Jul 2022
Pattern Recognition | VOL. 131

3D Point Cloud Completion with Geometric-Aware Adversarial Augmentation
Mengxi Wu ... Yi Fang
-
Mengxi Wu, et. al.Mengxi Wu ... Yi Fang
21 Aug 2022
21 Aug 2022

Applying Multiple Models to Improve the Accuracy of Prediction Results in Neural Networks
Hyun-Il Lim
-
Hyun-Il LimHyun-Il Lim
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Which neural network makes more explainable decisions? An approach towards measuring explainability

Abstract

Talk to us

Similar Papers

More From: Automated Software Engineering