Rethinking the validity of perturbation in single-step adversarial training

Yao Ge,Yun Li,Keji Han

doi:10.1016/j.patcog.2024.111007

Abstract

The neural network model has the drawback of making incorrect predictions under the influence of slight adversarial perturbations. Single-step adversarial training (AT) is an effective tool to bring adversarial robustness for the model to resist such attack. From the perspective of perturbation setting in training, we identify a conflict between the pursuit of greater robustness and the need to prevent catastrophic overfitting within the AT framework. To get out of this dilemma, we delve into the impact of perturbations on human visual perception. Our analysis reveals that examples containing more misleading features should be assigned a smaller perturbation magnitude to preserve subtle yet significant features. Conversely, examples encompassing more relevant features should be assigned a larger perturbation magnitude, enabling the model to adapt to stronger attacks effectively. Motivated by these insights, we propose a concise refinement to the AT framework to unleash its full potential for single-step AT. Instead of employing a fixed perturbation magnitude, we introduce a “band” of magnitudes, allowing each example to select an appropriate magnitude based on its visual characteristic. Through extensive experiments conducted on three datasets, we demonstrate the efficacy of our proposed strategy. Our approach not only improves the model’s robustness and prevents catastrophic overfitting but also effectively mitigates robust overfitting—an issue that has remained unresolved in the context of single-step AT, marking a significant advancement in the field.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking the validity of perturbation in single-step adversarial training

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Increasing-Margin Adversarial (IMA) training to improve adversarial robustness of neural networks
Linhai Ma ... Liang Liang
Computer Methods and Programs in Biomedicine | VOL. 240
Linhai Ma, et. al.Linhai Ma ... Liang Liang
24 Jun 2023
Computer Methods and Programs in Biomedicine | VOL. 240

Adversarial Training and Robustness in Machine Learning Frameworks
Mrs Sangeetha G ... Mr Bharath G
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Mrs Sangeetha G, et. al. Mrs Sangeetha G ... Mr Bharath G
22 Mar 2024
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Two-Way Feature-Aligned And Attention-Rectified Adversarial Training
Haitao Zhang ... Yahong Han
-
Haitao Zhang, et. al.Haitao Zhang ... Yahong Han
01 Jul 2020
01 Jul 2020

Downlink Power Allocation in Massive MIMO via Deep Learning: Adversarial Attacks and Training
B R Manoj ... Meysam Sadeghi
IEEE Transactions on Cognitive Communications and Networking | VOL. 8
B R Manoj, et. al.B R Manoj ... Meysam Sadeghi
01 Jun 2022
IEEE Transactions on Cognitive Communications and Networking | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking the validity of perturbation in single-step adversarial training

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition