Two-Way Feature-Aligned And Attention-Rectified Adversarial Training

Haitao Zhang,Yu-An Tan,Xiaohui Kuang,Fan Jia,Quanxin Zhang,Yahong Han

doi:10.1109/icme46284.2020.9102777

Abstract

Adversarial training increases robustness by augmenting training data with adversarial examples. However, vanilla adversarial training may be overfitting to certain adversarial attacks. Small perturbations in images bring in error which is gradually amplified when forwarded through the model so that the error leads to wrong classification. Besides, small perturbations will also distract classifier's attention to significant features that are relevant to the true label. In this paper, we propose a novel two-way feature-aligned and attention-rectified adversarial training (FAAR) to improve adversarial training (AT). FAAR utilizes two-way feature alignment and attention rectification to mitigate the problems mentioned above. FAAR effectively suppresses perturbations in lowlevel, high-level and global features by moving features of perturbed images towards those of clean images with twoway feature alignment. It also leads the model into focusing more on useful features which are correlated with true label through rectifying gradient-weighted attention. Besides, feature alignment activates attention rectification by reducing perturbations in high-level feature. Our proposed method FAAR surpasses other existing AT methods in three aspects. First, it pushes the model to keep invariant when dealing with different adversarial attacks and different magnitude of perturbations. Second, it can be applied to any convolution neural networks. Third, the training process is end-to-end. For experiments, FAAR shows promising defense performance on CIFAR-10 and ImageNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Way Feature-Aligned And Attention-Rectified Adversarial Training

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Gwonsang Ryu ... Daeseon Choi
Applied Intelligence | VOL. 53
Gwonsang Ryu, et. al.Gwonsang Ryu ... Daeseon Choi
05 Aug 2022
Applied Intelligence | VOL. 53

Towards evaluating the robustness of deep diagnostic models by adversarial attack.
Mengting Xu ... Daoqiang Zhang
Medical Image Analysis | VOL. 69
Mengting Xu, et. al.Mengting Xu ... Daoqiang Zhang
22 Jan 2021
Medical Image Analysis | VOL. 69

Provable, Structured, and Efficient Methods for Robustness of Deep Networks to Adversarial Examples

-

25 Jan 2021
25 Jan 2021

MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks
Chang Song ... Sicheng Li
-
Chang Song, et. al.Chang Song ... Sicheng Li
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Way Feature-Aligned And Attention-Rectified Adversarial Training

Abstract

Talk to us

Similar Papers