Towards a high robust neural network via feature matching

Jian Li,Yanming Guo,Liang Bai,Yingmei Wei,Songyang Lao,Yulun Wu

doi:10.1007/s13735-021-00219-0

Abstract

Image classification systems have been found vulnerable to adversarial attack, which is imperceptible to human but can easily fool deep neural networks. Recent researches indicate that regularizing the network by introducing randomness could greatly improve the model’s robustness against adversarial attack, but the randomness module would normally involve complex calculations and numerous additional parameters and seriously affect the model performance on clean data. In this paper, we propose a feature matching module to regularize the network. Specifically, our model learns a feature vector for each category and imposes additional restrictions on image features. Then, the similarity between image features and category features is used as the basis for classification. Our method does not introduce any additional network parameters than undefended model and can be easily integrated into any neural network. Experiments on the CIFAR10 and SVHN datasets highlight that our proposed module can effectively improve both clean data and perturbed data accuracy in comparison with the state-of-the-art defense methods and outperform the L2P method by 6.3%, 24% on clean and perturbed data, respectively, using ResNet-V2(18) architecture.

Highlights

Deep neural networks (DNNs) have demonstrated superior performance in diverse research areas, such as image classification [1] and machine translation [2]
Extensive experiments on the CIFAR10 and SVHN datasets indicate that our method achieves state-of-the-art robustness to adversarial attack in white-box and blackbox environments
We try to prove that the robustness provided of our method is not relying on gradient obfuscation from two perspectives: (1) In the above section, we have proved that the gradient-based attack successfully finds the correct perturb direction to complete an attack in our model

Summary

Introduction

Deep neural networks (DNNs) have demonstrated superior performance in diverse research areas, such as image classification [1] and machine translation [2]. Recent researches [3,4,5] indicate that deep models are vulnerable to adversarial examples, thereby seriously limiting their application in safely-critical scenarios. Based on the prior knowledge of the model, the adversarial attack algorithms can be generally divided into white-box attack and black-box attack. For the white-box attack, the adversary can get access to the entire information of the model (including the structure and the parameters); the gradient can be precisely calculated according to the predefined loss function and be propagated to the original input to generate the adversarial examples. While for the black-box attack, the model information is only partially accessible to the adversary. It needs to query the model frequently, in order to mimic

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards a high robust neural network via feature matching

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International journal of multimedia information retrieval

Lead the way for us

Journal: International journal of multimedia information retrieval	Publication Date: Oct 13, 2021
License type: open-access

Similar Papers

A defense method against backdoor attacks on neural networks
Sara Kaviani ... Insoo Sohn
Expert systems with applications | VOL. 213
Sara Kaviani, et. al.Sara Kaviani ... Insoo Sohn
13 Oct 2022
Expert systems with applications | VOL. 213

Bolstering Adversarial Robustness with Latent Disparity Regularization
David Schwartz ... Gregory Ditzler
-
David Schwartz, et. al.David Schwartz ... Gregory Ditzler
18 Jul 2021
18 Jul 2021

A Study on Adversarial Attacks and Defense Method on Binarized Neural Network
Van-Ngoc Dinh ... Quang-Kien Trinh
-
Van-Ngoc Dinh, et. al.Van-Ngoc Dinh ... Quang-Kien Trinh
20 Oct 2022
20 Oct 2022

Adversarial Machine Learning Attacks and Defense Methods in the Cyber Security Domain
Ishai Rosenberg ... Asaf Shabtai
ACM Computing Surveys | VOL. 54
Ishai Rosenberg, et. al.Ishai Rosenberg ... Asaf Shabtai
25 May 2021
ACM Computing Surveys | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards a high robust neural network via feature matching

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International journal of multimedia information retrieval