A Review of Adversarial Attacks and Defense Techniques in Text Processing Models

Ruihan Wang

doi:10.54254/2755-2721/96/20241306

Abstract

Abstract. With the rise of neural networks, the need for accuracy, robustness, and security has increased. Research has shown that small, carefully crafted perturbations, known as adversarial examples, can deceive models and lead to incorrect predictions. Current research focuses on the image domain, while there is a notable lack of exploration in the text domain, due to its discrete nature. This paper reviews adversarial attack techniques and defense strategies in text-based neural network models, aiming to improve the security and resilience of these models in practical applications. Adversarial examples, which can deceive models with small perturbations, expose vulnerabilities in their robustness and security. Techniques such as TextFooler focus on synonym replacement for generating adversarial examples, while Text Random Smooth (Text-RS) enhances defense through adaptive noise strategies. The research of search space aims to explore the feature of that, proposing search space for Imperceptibility (SSIP) and Search Space for Effectiveness (SSET) to estimate the different attack methods. Furthermore, the Chinese Variation Graph Integration (CHANGE) method improves the resilience of Chinese language models by leveraging variation graphs. These advancements highlight the importance of developing effective generation and defense mechanisms for adversarial examples in text processing models. Future research should enhance adversarial example techniques, explore efficient defense strategies, and investigate transferability to improve the security and robustness of text processing models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Review of Adversarial Attacks and Defense Techniques in Text Processing Models

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Journal: Applied and Computational Engineering	Publication Date: Nov 26, 2024
License type: cc-by

Similar Papers

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

A Survey on Adversarial Examples in Deep Learning
Kai Chen ... Leiming Yan
Journal on Big Data | VOL. 2
Kai Chen, et. al.Kai Chen ... Leiming Yan
01 Jan 2020
Journal on Big Data | VOL. 2

Towards Robust Ensemble Defense Against Adversarial Examples Attack
Nag Mani ... Melody Moh
-
Nag Mani, et. al.Nag Mani ... Melody Moh
01 Dec 2019
01 Dec 2019

Generating traceable adversarial text examples by watermarking in the semantic space
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 31
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
26 Nov 2022
Journal of Electronic Imaging | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Review of Adversarial Attacks and Defense Techniques in Text Processing Models

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering