Adversarial Defense Research Articles

Previous works have shown that automatic speaker verification (ASV) is seriously vulnerable to malicious spoofing attacks, such as replay, synthetic speech, and recently emerged adversarial attacks. Great efforts have been dedicated to defending ASV against replay and synthetic speech; however, only a few approaches have been explored to deal with adversarial attacks. All the existing approaches to tackle adversarial attacks for ASV require the knowledge for adversarial samples generation, but it is impractical for defenders to know the exact attack algorithms that are applied by the in-the-wild attackers. This work is among the first to perform adversarial defense for ASV without knowing the specific attack algorithms. Inspired by self-supervised learning models (SSLMs) that possess the merits of alleviating the superficial noise in the inputs and reconstructing clean samples from the interrupted ones, this work regards adversarial perturbations as one kind of noise and conducts adversarial defense for ASV by SSLMs. Specifically, we propose to perform adversarial defense from two perspectives: 1) adversarial perturbation purification and 2) adversarial perturbation detection. The purification module aims at alleviating the adversarial perturbations in the samples and pulling the contaminated adversarial inputs back towards the decision boundary. Experimental results show that our proposed purification module effectively counters adversarial attacks and outperforms traditional filters from both alleviating the adversarial noise and maintaining the performance of genuine samples. The detection module aims at detecting adversarial samples from genuine ones based on the statistical properties of ASV scores derived by a unique ASV integrating with different number of SSLMs. Experimental results show that our detection module helps shield the ASV by detecting adversarial samples. Both purification and detection methods are helpful for defending against different kinds of attack algorithms. Moreover, since there is no common metric for evaluating the ASV performance under adversarial attacks, this work also formalizes evaluation metrics for adversarial defense considering both purification and detection based approaches into account. We sincerely encourage future works to benchmark their approaches based on the proposed evaluation framework.

Although pre-trained language models (PrLMs) have achieved significant success, recent studies demonstrate that PrLMs are vulnerable to adversarial attacks. By generating adversarial examples with slight perturbations on different levels (sentence / word / character), adversarial attacks can fool PrLMs to generate incorrect predictions, which questions the robustness of PrLMs. However, we find that most existing textual adversarial examples are unnatural, which can be easily distinguished by both human and machine. Based on a general anomaly detector, we propose a novel metric (Degree of Anomaly) as a constraint to enable current adversarial attack approaches to generate more natural and imperceptible adversarial examples. Under this new constraint, the success rate of existing attacks drastically decreases, which reveals that the robustness of PrLMs is not as fragile as they claimed. In addition, we find that four types of randomization can invalidate a large portion of textual adversarial examples. Based on anomaly detector and randomization, we design a universal defense framework, which is among the first to perform textual adversarial defense without knowing the specific attack. Empirical results show that our universal defense framework achieves comparable or even higher after-attack accuracy with other specific defenses, while preserving higher original accuracy at the same time. Our work discloses the essence of textual adversarial attacks, and indicates that (i) further works of adversarial attacks should focus more on how to overcome the detection and resist the randomization, otherwise their adversarial examples would be easily detected and invalidated; and (ii) compared with the unnatural and perceptible adversarial examples, it is those undetectable adversarial examples that pose real risks for PrLMs and require more attention for future robustness-enhancing strategies.

Adversarial Defense Research Articles

Related Topics

Articles published on Adversarial Defense

PAGN: perturbation adaption generation network for point cloud adversarial defense

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Perturbation Inactivation Based Adversarial Defense for Face Recognition

NetFense: Adversarial Defenses against Privacy Attacks on Neural Networks for Graph Data

Sinkhorn Adversarial Attack and Defense.

Defensive Few-Shot Learning.

Towards a Constructive Framework for Control Theory

Are Deep Neural Networks Robust to Named Entities? An Adversarial Attack and Defense Perspective

Stylized Adversarial Defense.

VANET Jamming and Adversarial Attack Defense for Autonomous Vehicle Safety

Rethinking Textual Adversarial Defense for Pre-Trained Language Models

RAILS: A Robust Adversarial Immune-Inspired Learning System

Adversarial example defense based on image reconstruction.

Semi-supervised robust training with generalized perturbed neighborhood

Preprocessing Pipelines including Block-Matching Convolutional Neural Network for Image Denoising to Robustify Deep Reidentification against Evasion Attacks.

Enhancing adversarial defense for medical image analysis systems with pruning and attention mechanism.

Advanced Generative Adversarial Defense Model using Instance Sparsity Mapping Network

Adversarial Robustness of Deep Convolutional Neural Network-based Image Recognition Models: A Review

Text Adversarial Examples Generation and Defense Based on Reinforcement Learning

Attack-less adversarial training for a robust adversarial defense

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Adversarial Defense Research Articles

Related Topics

Articles published on Adversarial Defense

PAGN: perturbation adaption generation network for point cloud adversarial defense

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Perturbation Inactivation Based Adversarial Defense for Face Recognition

NetFense: Adversarial Defenses against Privacy Attacks on Neural Networks for Graph Data

Sinkhorn Adversarial Attack and Defense.

Defensive Few-Shot Learning.

Towards a Constructive Framework for Control Theory

Are Deep Neural Networks Robust to Named Entities? An Adversarial Attack and Defense Perspective

Stylized Adversarial Defense.

VANET Jamming and Adversarial Attack Defense for Autonomous Vehicle Safety

Rethinking Textual Adversarial Defense for Pre-Trained Language Models

RAILS: A Robust Adversarial Immune-Inspired Learning System

Adversarial example defense based on image reconstruction.

Semi-supervised robust training with generalized perturbed neighborhood

Preprocessing Pipelines including Block-Matching Convolutional Neural Network for Image Denoising to Robustify Deep Reidentification against Evasion Attacks.

Enhancing adversarial defense for medical image analysis systems with pruning and attention mechanism.

Advanced Generative Adversarial Defense Model using Instance Sparsity Mapping Network

Adversarial Robustness of Deep Convolutional Neural Network-based Image Recognition Models: A Review

Text Adversarial Examples Generation and Defense Based on Reinforcement Learning

Attack-less adversarial training for a robust adversarial defense