Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

Nupur Kumari,Vineeth N Balasubramanian,Balaji Krishnamurthy,Mayank Singh,Harshitha Machiraju,Abhishek Sinha

doi:10.24963/ijcai.2019/385

Nupur Kumari, Vineeth N Balasubramanian + Show 4 more

Open Access

https://doi.org/10.24963/ijcai.2019/385

Copy DOI

Abstract

Neural networks are vulnerable to adversarial attacks - small visually imperceptible crafted noise which when added to the input drastically changes the output. The most effective method of defending against adversarial attacks is to use the methodology of adversarial training. We analyze the adversarially trained robust models to study their vulnerability against adversarial attacks at the level of the latent layers. Our analysis reveals that contrary to the input layer which is robust to adversarial attack, the latent layer of these robust models are highly susceptible to adversarial perturbations of small magnitude. Leveraging this information, we introduce a new technique Latent Adversarial Training (LAT) which comprises of fine-tuning the adversarially trained models to ensure the robustness at the feature layers. We also propose Latent Attack (LA), a novel algorithm for constructing adversarial examples. LAT results in a minor improvement in test accuracy and leads to a state-of-the-art adversarial accuracy against the universal first-order adversarial PGD attack which is shown for the MNIST, CIFAR-10, CIFAR-100, SVHN and Restricted ImageNet datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Universal Adversarial Training Using Auxiliary Conditional Generative Model-Based Adversarial Attack Generation
Hiskias Dingeto ... Juntae Kim
Applied Sciences | VOL. 13
Hiskias Dingeto, et. al.Hiskias Dingeto ... Juntae Kim
31 Jul 2023
Applied Sciences | VOL. 13

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Gwonsang Ryu ... Daeseon Choi
Applied Intelligence | VOL. 53
Gwonsang Ryu, et. al.Gwonsang Ryu ... Daeseon Choi
05 Aug 2022
Applied Intelligence | VOL. 53

Towards evaluating the robustness of deep diagnostic models by adversarial attack.
Mengting Xu ... Daoqiang Zhang
Medical Image Analysis | VOL. 69
Mengting Xu, et. al.Mengting Xu ... Daoqiang Zhang
22 Jan 2021
Medical Image Analysis | VOL. 69

Lambertian-based adversarial attacks on deep-learning-based underwater side-scan sonar image classification
Qixiang Ma ... Wenxue Yu
Pattern Recognition | VOL. 138
Qixiang Ma, et. al.Qixiang Ma ... Wenxue Yu
08 Feb 2023
Pattern Recognition | VOL. 138

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

Abstract

Talk to us

Similar Papers