A Hybrid Adversarial Attack for Different Application Scenarios

Xiaohu Du,Jun Ma,Yusong Tan,Qinbo Wu,Shasha Li,Jie Yu,Zibo Yi

doi:10.3390/app10103559

Abstract

Adversarial attack against natural language has been a hot topic in the field of artificial intelligence security in recent years. It is mainly to study the methods and implementation of generating adversarial examples. The purpose is to better deal with the vulnerability and security of deep learning systems. According to whether the attacker understands the deep learning model structure, the adversarial attack is divided into black-box attack and white-box attack. In this paper, we propose a hybrid adversarial attack for different application scenarios. Firstly, we propose a novel black-box attack method of generating adversarial examples to trick the word-level sentiment classifier, which is based on differential evolution (DE) algorithm to generate semantically and syntactically similar adversarial examples. Compared with existing genetic algorithm based adversarial attacks, our algorithm can achieve a higher attack success rate while maintaining a lower word replacement rate. At the 10% word substitution threshold, we have increased the attack success rate from 58.5% to 63%. Secondly, when we understand the model architecture and parameters, etc., we propose a white-box attack with gradient-based perturbation against the same sentiment classifier. In this attack, we use a Euclidean distance and cosine distance combined metric to find the most semantically and syntactically similar substitution, and we introduce the coefficient of variation (CV) factor to control the dispersion of the modified words in the adversarial examples. More dispersed modifications can increase human imperceptibility and text readability. Compared with the existing global attack, our attack can increase the attack success rate and make modification positions in generated examples more dispersed. We’ve increased the global search success rate from 75.8% to 85.8%. Finally, we can deal with different application scenarios by using these two attack methods, that is, whether we understand the internal structure and parameters of the model, we can all generate good adversarial examples.

Highlights

In the past few decades, machine learning and deep learning techniques have achieved great success in some applications
The model will be judged as negative after the adversarial attack, so that our attack is considered successful
We propose a hybrid adversarial attack for different scenarios

Summary

Introduction

In the past few decades, machine learning and deep learning techniques have achieved great success in some applications. There are some technologies that have proven to be vulnerable. Some modified inputs can be distinguished by humans, but the neural network model will be classified incorrectly [1]. Adversarial attacks on neural networks have attracted a lot of attention. The main target of these attacks is a computer vision model for image classification [2,3]. Since the input features of these models are continuous, we can apply artificially indistinguishable

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: May 21, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Hybrid Adversarial Attack for Different Application Scenarios

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Generating More Effective and Imperceptible Adversarial Text Examples for Sentiment Classification
Xiaohu Du ... Zibo Yi
-
Xiaohu Du, et. al.Xiaohu Du ... Zibo Yi
01 Jan 2020
01 Jan 2020

FCDM: A Methodology Based on Sensor Pattern Noise Fingerprinting for Fast Confidence Detection to Adversarial Attacks
Yazhu Lan ... Guohe Zhang
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39
Yazhu Lan, et. al.Yazhu Lan ... Guohe Zhang
31 Jan 2020
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39

Automatic Selection Attacks Framework for Hard Label Black-Box Models
Xiaolei Liu ... Yu Peng
-
Xiaolei Liu, et. al.Xiaolei Liu ... Yu Peng
02 May 2022
02 May 2022

Adversarial Attack on Sentiment Classification
Yi-Ting Tsai ... Min-Chu Yang
-
Yi-Ting Tsai, et. al.Yi-Ting Tsai ... Min-Chu Yang
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Adversarial Attack for Different Application Scenarios

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences