Optimized Adversarial Example With Classification Score Pattern Vulnerability Removed

Hyun Kwon,Sunghwan Kim,Kyoungmin Ko

doi:10.1109/access.2021.3110473

Hyun Kwon, Sunghwan Kim + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3110473

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 3	License type: CC BY 4.0

Affiliation: Korea Military Academy, Konkuk University

Abstract

Neural networks provide excellent service on recognition tasks such as image recognition and speech recognition as well as for pattern analysis and other tasks in fields related to artificial intelligence. However, neural networks are vulnerable to adversarial examples. An adversarial example is a sample that is designed to be misclassified by a target model, although it poses no problem for recognition by humans, that is created by applying a minimal perturbation to a legitimate sample. Because the perturbation applied to the legitimate sample to create an adversarial example is optimized, the classification score for the target class has the characteristic of being similar to that for the legitimate class. This regularity occurs because minimal perturbations are applied only until the classification score for the target class is slightly higher than that for the legitimate class. Given the existence of this regularity in the classification scores, it is easy to detect an optimized adversarial example by looking for this pattern. However, the existing methods for generating optimized adversarial examples do not consider their weakness of allowing detectability by recognizing the pattern in the classification scores. To address this weakness, we propose an optimized adversarial example generation method in which the weakness due to the classification score pattern is removed. In the proposed method, a minimal perturbation is applied to a legitimate sample such that the classification score for the legitimate class is less than that for some of the other classes, and an optimized adversarial example is created with the pattern vulnerability removed. The results show that using 500 iterations, the proposed method can generate an optimized adversarial example that has a 100% attack success rate, with distortions of 2.81 and 2.23 for MNIST and Fashion-MNIST, respectively.

Highlights

Neural networks [1] exhibit excellent performance on artificial intelligence tasks such as image recognition [2], speech recognition [3], and pattern analysis [4]
Adversarial examples are samples created by applying a small perturbation to a legitimate sample such that humans have no problem recognizing it, but it will be misclassified by the target model
We report the results of our experiment using MNIST [14] and Fashion-MNIST [15] to evaluate the method’s performance and to verify that the proposed scheme generates an optimized adversarial example from which the classification score pattern vulnerability has been removed

Summary

INTRODUCTION

Neural networks [1] exhibit excellent performance on artificial intelligence tasks such as image recognition [2], speech recognition [3], and pattern analysis [4]. The basic method for generating adversarial examples is to apply the minimum adversarial perturbation to a legitimate sample that will cause the target model to misclassify it. We propose a method for generating an optimized adversarial example that removes the pattern vulnerability in the classification scores It does this by applying an additional minimal distortion to the legitimate sample so that the classification score of the generated adversarial example for the legitimate class and that for the target class are no longer similar. We report the results of our experiment using MNIST [14] and Fashion-MNIST [15] to evaluate the method’s performance and to verify that the proposed scheme generates an optimized adversarial example from which the classification score pattern vulnerability has been removed.

AND RELATED WORK

DISTORTION

OPTIMIZED ADVERSARIAL EXAMPLES

ASSUMPTION

METHOD

EXPERIMENT AND EVALUATION

DISCUSSION

Findings

Limitations

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimized Adversarial Example With Classification Score Pattern Vulnerability Removed

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Selective Audio Adversarial Example in Evasion Attack on Speech Recognition System
Hyun Kwon ... Hyunsoo Yoon
IEEE Transactions on Information Forensics and Security | VOL. 15
Hyun Kwon, et. al.Hyun Kwon ... Hyunsoo Yoon
12 Jul 2019
IEEE Transactions on Information Forensics and Security | VOL. 15

Fooling a Neural Network in Military Environments: Random Untargeted Adversarial Example
Hyun Kwon ... Hyunsoo Yoon
-
Hyun Kwon, et. al.Hyun Kwon ... Hyunsoo Yoon
01 Oct 2018
01 Oct 2018

Classification score approach for detecting adversarial example in deep neural network
Hyun Kwon ... Hyunsoo Yoon
Multimedia Tools and Applications | VOL. 80
Hyun Kwon, et. al.Hyun Kwon ... Hyunsoo Yoon
21 Nov 2020
Multimedia Tools and Applications | VOL. 80

Generating watermarked adversarial texts
Mingjie Li ... Zichi Wang
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Zichi Wang
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimized Adversarial Example With Classification Score Pattern Vulnerability Removed

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access