LADDER: Latent boundary-guided adversarial training

Xiaowei Zhou,Ivor W Tsang,Jie Yin

doi:10.1007/s10994-022-06203-x

Abstract

Deep Neural Networks (DNNs) have recently achieved great success in many classification tasks. Unfortunately, they are vulnerable to adversarial attacks that generate adversarial examples with a small perturbation to fool DNN models, especially in model sharing scenarios. Adversarial training is proved to be the most effective strategy that injects adversarial examples into model training to improve the robustness of DNN models against adversarial attacks. However, adversarial training based on the existing adversarial examples fails to generalize well to standard, unperturbed test data. To achieve a better trade-off between standard accuracy and adversarial robustness, we propose a novel adversarial training framework called LAtent bounDary-guided aDvErsarial tRaining (LADDER) that adversarially trains DNN models on latent boundary-guided adversarial examples. As opposed to most of the existing methods that generate adversarial examples in the input space, LADDER generates a myriad of high-quality adversarial examples through adding perturbations to latent features. The perturbations are made along the normal of the decision boundary constructed by an SVM with an attention mechanism. We analyze the merits of our generated boundary-guided adversarial examples from a boundary field perspective and visualization view. Extensive experiments and detailed analysis on MNIST, SVHN, CelebA, and CIFAR-10 validate the effectiveness of LADDER in achieving a better trade-off between standard accuracy and adversarial robustness as compared with vanilla DNNs and competitive baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Sep 9, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

LADDER: Latent boundary-guided adversarial training

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Gwonsang Ryu ... Daeseon Choi
Applied Intelligence | VOL. 53
Gwonsang Ryu, et. al.Gwonsang Ryu ... Daeseon Choi
05 Aug 2022
Applied Intelligence | VOL. 53

Layer-wise Adversarial Training Approach to Improve Adversarial Robustness
Xiaoyi Chen ... Ni Zhang
-
Xiaoyi Chen, et. al.Xiaoyi Chen ... Ni Zhang
01 Jul 2020
01 Jul 2020

MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks
Chang Song ... Sicheng Li
-
Chang Song, et. al.Chang Song ... Sicheng Li
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LADDER: Latent boundary-guided adversarial training

Abstract

Talk to us

Similar Papers

More From: Machine Learning