Vulnerability Of Deep Neural Networks Research Articles

Vulnerability of deep neural networks to small adversarial examples has recently attracted a lot of attention. As a result, making models robust to small adversarial noises has been sought in many safety critical applications. Adversarial training through iterative projected gradient descent (PGD) has been established as one of the mainstream ideas to achieve this goal. However, PGD is computationally demanding and often prohibitive in the case of large datasets and models. For this reason, the single-step PGD, also known as the Fast Gradient Sign Method (FGSM), has recently gained interest in the field. Unfortunately, FGSM-training leads to a phenomenon called “catastrophic overfitting,” which is a sudden drop in the test adversarial accuracy under the PGD attack. In this paper, we propose new methods to prevent this failure mode of the FGSM-based attacks with almost no extra computational cost. The proposed methods are also backed up with theoretical insights into the causes of the catastrophic overfitting. Our intuition is that small input gradients play a key role in this phenomenon. The signs of such gradients are quite unstable and fragile from an epoch to the next, making the signed gradient method discontinuous along the training process. These instabilities introduce large weight updates by the stochastic gradient descent, and hence potentially cause overfitting. To mitigate this issue, we propose to simply identify such gradients and make them zero prior to taking the sign in the FGSM attack calculation that is used in the training. This remedy makes the training perturbations stable, while almost preserving the adversarial property of such perturbations. The idea while being simple and efficient, achieves competitive adversarial accuracy on various datasets and can be used as an affordable method to train robust deep neural networks2.

Read full abstract

Adversarial robustness has attracted extensive studies in various fields by increasing the interpretability of deep learning and enhancing the understanding of neural network models. In realistic scenarios such as UAV control system, imbalanced datasets are a consensus. Therefore, how to solve the adversarial robustness on imbalanced datasets is a more and more inescapable problem in UAV control system. There have been some works on adversarial robustness on imbalanced datasets, which bring us a deeper understanding of vulnerability of deep neural networks and the generation of adversarial examples. To adjust the classification plane after training, the long-tailed robustness framework is usually designed to be multi-stage, and different classifiers are used in different stages, which can improve the robustness through multi learning. The existing methods are hardly considered to effectively handle the long-tailed robustness problem in UAV control system. To explore the intrinsic features of long-tailed robustness, we propose a one-stage robustness framework. First, we study different classifiers and propose a general cosine classifier. By changing the general cosine classifier adaptively, the model obtains a more robust classification. Then, we analyze the scalability of the focal loss and design a focal-margin loss. Finally, we design a category focus mobile learning strategy, obtained more robust features by changing the learning emphasis with this strategy. From this, we design a simple and efficient one-stage dynamic adversarial robustness method DRL under long-tailed distribution, which consists of an adaptive cosine classifier and a focal-margin loss under long-tailed mobile learning. The extended experiments demonstrate the superiority of our approach over other state-of-the-art methods, and the effectiveness of the designed module. This method can effectively solve the long-tailed robustness problem on UAV control system and other terminals.

Read full abstract

Vulnerability Of Deep Neural Networks Research Articles

Related Topics

Articles published on Vulnerability Of Deep Neural Networks

Perturbation Augmentation for Adversarial Training with Diverse Attacks

Adversarial Danger Identification on Temporally Dynamic Graphs.

Adversarial attacks on GAN-based image fusion

RETRACTED: Integrating confidence calibration and adversarial robustness via adversarial calibration entropy

Image-Level Adaptive Adversarial Ranking for Person Re-Identification.

Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization

A survey on vulnerability of deep neural networks to adversarial examples and defense approaches to deal with them

Masking and purifying inputs for blocking textual adversarial attacks

QuanDA: GPU Accelerated Quantitative Deep Neural Network Analysis

Query-Efficient Generation of Adversarial Examples for Defensive DNNs via Multiobjective Optimization

ZeroGrad: Costless conscious remedies for catastrophic overfitting in the FGSM adversarial training

Sparse fooling images: Fooling machine perception through unrecognizable images

DRL: Dynamic rebalance learning for adversarial robustness of UAV with long-tailed distribution

Sensitive region-aware black-box adversarial attacks

Efficient Query-based Black-box Attack against Cross-modal Hashing Retrieval

Adversarial attack method against image classification based on haze perturbation

Adversarial Examples Protect Your Privacy on Speech Enhancement System

Robustness Learning via Inference-Softmax Cross Entropy in Misaligned Distribution of Image

Visually imperceptible adversarial patch attacks

Textual adversarial attacks by exchanging text‐self words

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Vulnerability Of Deep Neural Networks Research Articles

Related Topics

Articles published on Vulnerability Of Deep Neural Networks

Perturbation Augmentation for Adversarial Training with Diverse Attacks

Adversarial Danger Identification on Temporally Dynamic Graphs.

Adversarial attacks on GAN-based image fusion

RETRACTED: Integrating confidence calibration and adversarial robustness via adversarial calibration entropy

Image-Level Adaptive Adversarial Ranking for Person Re-Identification.

Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization

A survey on vulnerability of deep neural networks to adversarial examples and defense approaches to deal with them

Masking and purifying inputs for blocking textual adversarial attacks

QuanDA: GPU Accelerated Quantitative Deep Neural Network Analysis

Query-Efficient Generation of Adversarial Examples for Defensive DNNs via Multiobjective Optimization

ZeroGrad: Costless conscious remedies for catastrophic overfitting in the FGSM adversarial training

Sparse fooling images: Fooling machine perception through unrecognizable images

DRL: Dynamic rebalance learning for adversarial robustness of UAV with long-tailed distribution

Sensitive region-aware black-box adversarial attacks

Efficient Query-based Black-box Attack against Cross-modal Hashing Retrieval

Adversarial attack method against image classification based on haze perturbation

Adversarial Examples Protect Your Privacy on Speech Enhancement System

Robustness Learning via Inference-Softmax Cross Entropy in Misaligned Distribution of Image

Visually imperceptible adversarial patch attacks

Textual adversarial attacks by exchanging text‐self words