Attack to Fool and Explain Deep Networks.

Naveed Akhtar,Mohammad A A K Jalwana,Mohammed Bennamoun,Ajmal Mian

doi:10.1109/tpami.2021.3083769

Abstract

Deep visual models are susceptible to adversarial perturbations to inputs. Although these signals are carefully crafted, they still appear noise-like patterns to humans. This observation has led to the argument that deep visual representation is misaligned with human perception. We counter-argue by providing evidence of human-meaningful patterns in adversarial perturbations. We first propose an attack that fools a network to confuse a whole category of objects (source class) with a target label. Our attack also limits the unintended fooling by samples from non-sources classes, thereby circumscribing human-defined semantic notions for network fooling. We show that the proposed attack not only leads to the emergence of regular geometric patterns in the perturbations, but also reveals insightful information about the decision boundaries of deep models. Exploring this phenomenon further, we alter the 'adversarial' objective of our attack to use it as a tool to 'explain' deep visual representation. We show that by careful channeling and projection of the perturbations computed by our method, we can visualize a model's understanding of human-defined semantic notions. Finally, we exploit the explanability properties of our perturbations to perform image generation, inpainting and interactive image manipulation by attacking adversarialy robust 'classifiers'. In all, our major contribution is a novel pragmatic adversarial attack that is subsequently transformed into a tool to interpret the visual models. The article also makes secondary contributions in terms of establishing the utility of our attack beyond the adversarial objective with multiple interesting applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attack to Fool and Explain Deep Networks.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: May 26, 2021
Citations: 16

Similar Papers

Attack to Explain Deep Representation
Mohammad A A K Jalwana ... Naveed Akhtar
-
Mohammad A A K Jalwana, et. al.Mohammad A A K Jalwana ... Naveed Akhtar
01 Jun 2020
01 Jun 2020

Disentangling Image distortions in deep feature space
Simone Bianco ... Luigi Celona
Pattern Recognition Letters | VOL. 148
Simone Bianco, et. al.Simone Bianco ... Luigi Celona
01 Aug 2021
Pattern Recognition Letters | VOL. 148

Fine-grained visual marine vessel classification for coastal surveillance and defense applications
Aykut Koç ... Kaan Karaman
-
Aykut Koç, et. al.Aykut Koç ... Kaan Karaman
05 Oct 2017
05 Oct 2017

Interpreting Visual Representations of Neural Networks via Network Dissection
Bolei Zhou ... David Bau
Journal of Vision | VOL. 18
Bolei Zhou, et. al.Bolei Zhou ... David Bau
01 Sep 2018
Journal of Vision | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attack to Fool and Explain Deep Networks.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence