ZeroGrad: Costless conscious remedies for catastrophic overfitting in the FGSM adversarial training

Zeinab Golgooni,Mehrdad Saberi,Masih Eskandar,Mohammad Hossein Rohban

doi:10.1016/j.iswa.2023.200258

Zeinab Golgooni, Mehrdad Saberi + Show 2 more

Open Access

https://doi.org/10.1016/j.iswa.2023.200258

Copy DOI

Abstract

Vulnerability of deep neural networks to small adversarial examples has recently attracted a lot of attention. As a result, making models robust to small adversarial noises has been sought in many safety critical applications. Adversarial training through iterative projected gradient descent (PGD) has been established as one of the mainstream ideas to achieve this goal. However, PGD is computationally demanding and often prohibitive in the case of large datasets and models. For this reason, the single-step PGD, also known as the Fast Gradient Sign Method (FGSM), has recently gained interest in the field. Unfortunately, FGSM-training leads to a phenomenon called “catastrophic overfitting,” which is a sudden drop in the test adversarial accuracy under the PGD attack. In this paper, we propose new methods to prevent this failure mode of the FGSM-based attacks with almost no extra computational cost. The proposed methods are also backed up with theoretical insights into the causes of the catastrophic overfitting. Our intuition is that small input gradients play a key role in this phenomenon. The signs of such gradients are quite unstable and fragile from an epoch to the next, making the signed gradient method discontinuous along the training process. These instabilities introduce large weight updates by the stochastic gradient descent, and hence potentially cause overfitting. To mitigate this issue, we propose to simply identify such gradients and make them zero prior to taking the sign in the FGSM attack calculation that is used in the training. This remedy makes the training perturbations stable, while almost preserving the adversarial property of such perturbations. The idea while being simple and efficient, achieves competitive adversarial accuracy on various datasets and can be used as an affordable method to train robust deep neural networks2.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Intelligent Systems with Applications	Publication Date: Jul 18, 2023
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

ZeroGrad: Costless conscious remedies for catastrophic overfitting in the FGSM adversarial training

Abstract

Talk to us

Similar Papers

More From: Intelligent Systems with Applications

Lead the way for us

Similar Papers

Robust Single-Step Adversarial Training with Regularizer
Lehui Xie ... Ximeng Liu
-
Lehui Xie, et. al.Lehui Xie ... Ximeng Liu
01 Jan 2020
01 Jan 2020

Improving adversarial robustness of deep neural networks by using semantic information
Lina Wang ... Wei Wang
Knowledge-Based Systems | VOL. 226
Lina Wang, et. al.Lina Wang ... Wei Wang
13 May 2021
Knowledge-Based Systems | VOL. 226

Robust Training with Adversarial Examples on Industrial Data
Julian Knaup ... Volker Lohweg
-
Julian Knaup, et. al.Julian Knaup ... Volker Lohweg
15 Nov 2023
15 Nov 2023

Transfer Robustness to Downstream Tasks Through Sampling Adversarial Perturbations
Ivan Reyes-Amezcua ... Andres Mendez-Vazquez
-
Ivan Reyes-Amezcua, et. al.Ivan Reyes-Amezcua ... Andres Mendez-Vazquez
18 Jun 2023
18 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ZeroGrad: Costless conscious remedies for catastrophic overfitting in the FGSM adversarial training

Abstract

Talk to us

Similar Papers

More From: Intelligent Systems with Applications