Universal Adversarial Perturbation Generated by Attacking Layer-wise Relevance Propagation

Zifei Wang,Jie Yang,Nikola Kasabov,Xiaolin Huang

doi:10.1109/is48319.2020.9199956

Universal Adversarial Perturbation Generated by Attacking Layer-wise Relevance Propagation

Zifei Wang, Jie Yang + Show 2 more

https://doi.org/10.1109/is48319.2020.9199956

Copy DOI

Publication Date: Aug 1, 2020

Citations: 18

Affiliation: Shanghai Jiao Tong University, Auckland University of Technology

#Universal Adversarial Perturbation #Universal Adversarial Perturbations + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

The vulnerability of Deep Neural Networks (DNNs) to adversarial attacks has become an important research area of machine learning. It has been known that many state-of-the-art DNNs suffer the risk of universal adversarial perturbations, which are image-agnostic and able to lead misclassifications with high probability. In this paper, we propose a novel method to create such universal adversarial perturbations. Our approach is the first to generate universal perturbations by attacking the attention heat maps with the interpretation method, Layer-wise Relevance Propagation. It is demonstrated that our method achieves high fooling ratios on image classification DNNs pre-trained by ImageNet dataset. Moreover, our attack shows good transferability across different DNNs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.