A Backdoor Attack Against LSTM-Based Text Classification Systems

Jiazhu Dai,Chuanshuai Chen,Yufeng Li

doi:10.1109/access.2019.2941376

Jiazhu Dai, Chuanshuai Chen + Show 1 more

Open Access

https://doi.org/10.1109/access.2019.2941376

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 187	License type: CC BY 4.0

Affiliation: Shanghai University of Engineering Science

Abstract

With the widespread use of deep learning system in many applications, the adversary has strong incentive to explore vulnerabilities of deep neural networks and manipulate them. Backdoor attacks against deep neural networks have been reported to be a new type of threat. In this attack, the adversary will inject backdoors into the model and then cause the misbehavior of the model through inputs including backdoor triggers. Existed research mainly focuses on backdoor attacks in image classification based on CNN, little attention has been paid to the backdoor attacks in RNN. In this paper, we implement a backdoor attack against LSTM-based text classification by data poisoning. After the backdoor is injected, the model will misclassify any text samples that contains a specific trigger sentence into the target category determined by the adversary. The backdoor attack is stealthy and the backdoor injected in the model has little impact on the performance of the model. We consider the backdoor attack in black-box setting, where the adversary has no knowledge of model structures or training algorithms except for a small amount of training data. We verify the attack through sentiment analysis experiment on the dataset of IMDB movie reviews. The experimental results indicate that our attack can achieve around 96% success rate with 1% poisoning rate.

Highlights

Artificial intelligence and deep learning have been the hot topic in the computer science field for the past few years
We evaluate our backdoor attack through sentiment analysis experiments
Our attack method injects the backdoor into LSTM neural networks by data poisoning

Summary

INTRODUCTION

Artificial intelligence and deep learning have been the hot topic in the computer science field for the past few years. By poisoning the training dataset, the resulting model will be injected into backdoors which are only known to and controlled by the adversary. Our contributions are summarized as follows: (1) We implement a black-box backdoor attacks against LSTM-based text classification system, the adversary has no knowledge of model structures or training algorithms except for a small amount of training data. (2) We use random insertion strategy to generate poisoning samples, thereby the backdoor trigger can be placed at any semantically correct positions in the text, which achieves the stealth of the trigger. (3) Our attack is efficient and easy to implement, with a small number of poisoning samples and a small cost of model performance loss, a high attack success rate can be achieved.

RELATED WORK

BACKGROUND

LONG SHORT-TERM MEMORY NETWORKS

EXPERIMENT EVALUATION

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Backdoor Attack Against LSTM-Based Text Classification Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Vulnerabilities of Deep Learning-Driven Semantic Communications to Backdoor (Trojan) Attacks
Yalin E Sagduyu ... Aylin Yener
-
Yalin E Sagduyu, et. al.Yalin E Sagduyu ... Aylin Yener
22 Mar 2023
22 Mar 2023

Test-Time Detection of Backdoor Triggers for Poisoned Deep Neural Networks
Xi Li ... George Kesidis
-
Xi Li, et. al.Xi Li ... George Kesidis
23 May 2022
23 May 2022

A semantic backdoor attack against graph convolutional networks
Jiazhu Dai ... Chenhong Cao
Neurocomputing | VOL. 600
Jiazhu Dai, et. al.Jiazhu Dai ... Chenhong Cao
05 Jul 2024
Neurocomputing | VOL. 600

PTB: Robust physical backdoor attacks against deep neural networks in real world
Mingfu Xue ... Weiqiang Liu
Computers & Security | VOL. 118
Mingfu Xue, et. al.Mingfu Xue ... Weiqiang Liu
15 Apr 2022
Computers & Security | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Backdoor Attack Against LSTM-Based Text Classification Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access