Defense against adversarial attacks via textual embeddings based on semantic associative field

Jiacheng Huang,Long Chen

doi:10.1007/s00521-023-08946-7

Abstract

AbstractDeep neural networks are known to be vulnerable to various types of adversarial attacks, especially word-level attacks, in the field of natural language processing. In recent years, various defense methods are proposed against word-level attacks; however, most of those defense methods only focus on synonyms substitution-based attacks, while word-level attacks are not based on synonym substitution. In this paper, we propose a textual adversarial defense method against word-level adversarial attacks via textual embedding based on the semantic associative field. More specifically, we analyze the reasons why humans can read and understand textual adversarial examples and observe two crucial points: (1) There must be a relation between the original word and the perturbed word or token. (2) Such a kind of relation enables humans to infer original words, while humans have the ability to associations. Motivated by this, we introduce the concept of semantic associative field and propose a new defense method by building a robust word embedding, that is, we calculate the word vector by exerting the related word vector to it with potential function and weighted embedding sampling for simulating the semantic influence between words in same semantic field. We conduct comprehensive experiments and demonstrate that the models using the proposed method can achieve higher accuracy than the baseline defense methods under various adversarial attacks or original testing sets. Moreover, the proposed method is more universal, while it is irrelevant to model structure and will not affect the efficiency of training.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Defense against adversarial attacks via textual embeddings based on semantic associative field

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Nov 2, 2023
License type: CC BY 4.0

Similar Papers

Towards evaluating the robustness of deep diagnostic models by adversarial attack.
Mengting Xu ... Daoqiang Zhang
Medical Image Analysis | VOL. 69
Mengting Xu, et. al.Mengting Xu ... Daoqiang Zhang
22 Jan 2021
Medical Image Analysis | VOL. 69

Generalization to Mitigate Synonym Substitution Attacks
Basemah Alshemali ... Jugal Kalita
-
Basemah Alshemali, et. al.Basemah Alshemali ... Jugal Kalita
01 Jan 2020
01 Jan 2020

Sequence Squeezing: A Defense Method Against Adversarial Examples for API Call-Based RNN Variants
Ishai Rosenberg ... Asaf Shabtai
-
Ishai Rosenberg, et. al.Ishai Rosenberg ... Asaf Shabtai
18 Jul 2021
18 Jul 2021

A divide-and-conquer reconstruction method for defending against adversarial example attacks
Xiyao Liu ... Hui Fang
Visual Intelligence | VOL. 2
Xiyao Liu, et. al.Xiyao Liu ... Hui Fang
09 Oct 2024
Visual Intelligence | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Defense against adversarial attacks via textual embeddings based on semantic associative field

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications