Defending Against Adversarial Attack Towards Deep Neural Networks Via Collaborative Multi-Task Training

Derui Wang,Chaoran Li,Yang Xiang,Sheng Wen,Surya Nepal

doi:10.1109/tdsc.2020.3014390

Abstract

Deep neural networks (DNNs) are known to be vulnerable to adversarial examples which contain human-imperceptible perturbations. A series of defending methods, either proactive defence or reactive defence, have been proposed in the recent years. However, most of the methods can only handle specific attacks. For example, proactive defending methods are invalid against grey-box or white-box attacks, while reactive defending methods are challenged by low-distortion adversarial examples or transferring adversarial examples. This becomes a critical problem since a defender usually does not have the type of attack as <i>a priori</i> knowledge. Moreover, existing two-pronged defences (e.g., MagNet), which take advantage of both proactive and reactive methods, have been reported as broken under transferring attacks. To address this problem, this article proposed a novel defensive framework based on collaborative multi-task training, aiming at providing defence for different types of attacks. The proposed defence first encodes training labels into label pairs and counters black-box attacks leveraging adversarial training supervised by the encoded label pairs. The defence further constructs a detector to identify and reject high-confidence adversarial examples that bypass the black-box defence. In addition, the proposed collaborative architecture can prevent adversaries from finding valid adversarial examples when the defence strategy is exposed. In the experiments, we evaluated our defence against four state-of-the-art attacks on <inline-formula><tex-math notation="LaTeX">$MNIST$</tex-math></inline-formula> and <inline-formula><tex-math notation="LaTeX">$CIFAR10$</tex-math></inline-formula> datasets. The results showed that our defending method achieved up to 96.3 percent classification accuracy on black-box adversarial examples, and detected up to 98.7 percent of the high confidence adversarial examples. It only decreased the model accuracy on benign example classification by 2.1 percent for the <inline-formula><tex-math notation="LaTeX">$CIFAR10$</tex-math></inline-formula> dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Defending Against Adversarial Attack Towards Deep Neural Networks Via Collaborative Multi-Task Training

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Journal: IEEE Transactions on Dependable and Secure Computing	Publication Date: Aug 6, 2020
Citations: 17

Similar Papers

A divide-and-conquer reconstruction method for defending against adversarial example attacks
Xiyao Liu ... Hui Fang
Visual Intelligence | VOL. 2
Xiyao Liu, et. al.Xiyao Liu ... Hui Fang
09 Oct 2024
Visual Intelligence | VOL. 2

FCDM: A Methodology Based on Sensor Pattern Noise Fingerprinting for Fast Confidence Detection to Adversarial Attacks
Yazhu Lan ... Guohe Zhang
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39
Yazhu Lan, et. al.Yazhu Lan ... Guohe Zhang
31 Jan 2020
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

Interpreting Adversarial Examples in Deep Learning: A Review
Sicong Han ... Chenhao Lin
ACM Computing Surveys | VOL. 55
Sicong Han, et. al.Sicong Han ... Chenhao Lin
17 Jul 2023
ACM Computing Surveys | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Defending Against Adversarial Attack Towards Deep Neural Networks Via Collaborative Multi-Task Training

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing