Attack as Detection: Using Adversarial Attack Methods to Detect Abnormal Examples

Zhe Zhao,Guangke Chen,Jun Sun,Tong Liu,Taishan Li,Fu Song,Jingyi Wang

doi:10.1145/3631977

Abstract

As a new programming paradigm, deep learning (DL) has achieved impressive performance in areas such as image processing and speech recognition, and has expanded its application to solve many real-world problems. However, neural networks and DL are normally black-box systems; even worse, DL-based software are vulnerable to threats from abnormal examples, such as adversarial and backdoored examples constructed by attackers with malicious intentions as well as unintentionally mislabeled samples. Therefore, it is important and urgent to detect such abnormal examples. Although various detection approaches have been proposed respectively addressing some specific types of abnormal examples, they suffer from some limitations; until today, this problem is still of considerable interest. In this work, we first propose a novel characterization to distinguish abnormal examples from normal ones based on the observation that abnormal examples have significantly different (adversarial) robustness from normal ones. We systemically analyze those three different types of abnormal samples in terms of robustness and find that they have different characteristics from normal ones. As robustness measurement is computationally expensive and hence can be challenging to scale to large networks, we then propose to effectively and efficiently measure robustness of an input sample using the cost of adversarially attacking the input, which was originally proposed to test robustness of neural networks against adversarial examples. Next, we propose a novel detection method, named attack as detection (A 2 D for short), which uses the cost of adversarially attacking an input instead of robustness to check if it is abnormal. Our detection method is generic, and various adversarial attack methods could be leveraged. Extensive experiments show that A 2 D is more effective than recent promising approaches that were proposed to detect only one specific type of abnormal examples. We also thoroughly discuss possible adaptive attack methods to our adversarial example detection method and show that A 2 D is still effective in defending carefully designed adaptive adversarial attack methods—for example, the attack success rate drops to 0% on CIFAR10.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attack as Detection: Using Adversarial Attack Methods to Detect Abnormal Examples

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: Mar 15, 2024
Citations: 1

Similar Papers

Transfer-based Adversarial Attack with Rectified Adam and Color Invariance
Jia Ding ... Zhiwu Xu
International Journal of Software and Informatics | VOL. 12
Jia Ding, et. al.Jia Ding ... Zhiwu Xu
01 Jan 2021
International Journal of Software and Informatics | VOL. 12

Lambertian-based adversarial attacks on deep-learning-based underwater side-scan sonar image classification
Qixiang Ma ... Wenxue Yu
Pattern Recognition | VOL. 138
Qixiang Ma, et. al.Qixiang Ma ... Wenxue Yu
08 Feb 2023
Pattern Recognition | VOL. 138

Detection Tolerant Black-Box Adversarial Attack Against Automatic Modulation Classification With Deep Learning
Peihan Qi ... Lizhan Wang
IEEE Transactions on Reliability | VOL. 71
Peihan Qi, et. al.Peihan Qi ... Lizhan Wang
01 Jun 2022
IEEE Transactions on Reliability | VOL. 71

A Word-Level Adversarial Attack Method Based on Sememes and an Improved Quantum-Behaved Particle Swarm Optimization.
Qidong Chen ... Jun Sun
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP
Qidong Chen, et. al.Qidong Chen ... Jun Sun
01 Jan 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attack as Detection: Using Adversarial Attack Methods to Detect Abnormal Examples

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology