White-box Attacks Research Articles

Deep learning based on labeled data has brought massive success in computer vision, speech recognition, and natural language processing. Nevertheless, labeled data is just a drop in the ocean compared with unlabeled data. How can people utilize the unlabeled data effectively? Research has focused on unsupervised and semi-supervised learning to solve such a problem. Some theoretical and empirical studies have proved that unlabeled data can help boost the generalization ability and robustness under adversarial attacks. However, current theoretical research on the relationship between robustness and unlabeled data limits its scope to toy datasets. Meanwhile, the visual models in autonomous driving need a significant improvement in robustness to guarantee security and safety. This paper proposes a semi-supervised learning framework for object detection in autonomous vehicles, improving the robustness with unlabeled data. Firstly, we build a baseline with the transfer learning of an unsupervised contrastive learning method—Momentum Contrast (MoCo). Secondly, we propose a semi-supervised co-training method to label the unlabeled data for retraining, which improves generalization on the autonomous driving dataset. Thirdly, we apply the unsupervised Bounding Box data augmentation (BBAug) method based on a search algorithm, which uses reinforcement learning to improve the robustness of object detection for autonomous driving. We present an empirical study on the KITTI dataset with diverse adversarial attack methods. Our proposed method realizes the state-of-the-art generalization and robustness under white-box attacks (DPatch and Contextual Patch) and black-box attacks (Gaussian noise, Rain, Fog, and so on). Our proposed method and empirical study show that using more unlabeled data benefits the robustness of perception systems in autonomous driving.

Read full abstract

Deep neural networks (DNNs) play key roles in various artificial intelligence applications such as image classification and object recognition. However, a growing number of studies have shown that there exist adversarial examples in DNNs, which are almost imperceptibly different from the original samples but can greatly change the output of DNNs. Recently, many white-box attack algorithms have been proposed, and most of the algorithms concentrate on how to make the best use of gradients per iteration to improve adversarial performance. In this article, we focus on the properties of the widely used activation function, rectified linear unit (ReLU), and find that there exist two phenomena (i.e., wrong blocking and over transmission) misguiding the calculation of gradients for ReLU during backpropagation. Both issues enlarge the difference between the predicted changes of the loss function from gradients and corresponding actual changes and misguide the optimized direction, which results in larger perturbations. Therefore, we propose a universal gradient correction adversarial example generation method, called ADV-ReLU, to enhance the performance of gradient-based white-box attack algorithms such as fast gradient signed method (FGSM), iterative FGSM (I-FGSM), momentum I-FGSM (MI-FGSM), and variance tuning MI-FGSM (VMI-FGSM). Through backpropagation, our approach calculates the gradient of the loss function with respect to the network input, maps the values to scores, and selects a part of them to update the misguided gradients. Comprehensive experimental results on ImageNet and CIFAR10 demonstrate that our ADV-ReLU can be easily integrated into many state-of-the-art gradient-based white-box attack algorithms, as well as transferred to black-box attacks, to further decrease perturbations measured in the l2 -norm.

Read full abstract

White-box Attacks Research Articles

Related Topics

Articles published on White-box Attacks

Attacking Transformers with Feature Diversity Adversarial Perturbation

TTTS: Tree Test Time Simulation for Enhancing Decision Tree Robustness against Adversarial Examples

Improving adversarial transferability through frequency enhanced momentum

Enhancing Generalization in Few-Shot Learning for Detecting Unknown Adversarial Examples

Adversarial Attacks with Defense Mechanisms on Convolutional Neural Networks and Recurrent Neural Networks for Malware Classification

Adversarial Robustness with Partial Isometry.

Multi-scale architectures matter: Examining the adversarial robustness of flow-based lossless compression

Robust object detection for autonomous driving based on semi-supervised learning

SecureSense: Defending Adversarial Attack for Secure Device-Free Human Activity Recognition

Gradient Correction for White-Box Adversarial Attacks.

LAFIT: Efficient and Reliable Evaluation of Adversarial Defenses With Latent Features.

Practical Feature Inference Attack in Vertical Federated Learning During Prediction in Artificial Internet of Things

AD 2 VNCS: A dversarial D efense and D evice V ariation-tolerance in Memristive Crossbar-based N euromorphic C omputing S ystems

AdaNI: Adaptive Noise Injection to improve adversarial robustness

Adv-BDPM: Adversarial attack based on Boundary Diffusion Probability Model

Adversarial image generation by spatial transformation in perceptual colorspaces

On the Privacy of Counting Bloom Filters Under a Black-Box Attacker

CANARY: An Adversarial Robustness Evaluation Platform for Deep Learning Models on Image Classification

LWED: Lightweight white-box encryption communication system for drones over CARX algorithm

Towards evaluating robustness of violence detection in videos using cross-domain transferability

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

White-box Attacks Research Articles

Related Topics

Articles published on White-box Attacks

Attacking Transformers with Feature Diversity Adversarial Perturbation

TTTS: Tree Test Time Simulation for Enhancing Decision Tree Robustness against Adversarial Examples

Improving adversarial transferability through frequency enhanced momentum

Enhancing Generalization in Few-Shot Learning for Detecting Unknown Adversarial Examples

Adversarial Attacks with Defense Mechanisms on Convolutional Neural Networks and Recurrent Neural Networks for Malware Classification

Adversarial Robustness with Partial Isometry.

Multi-scale architectures matter: Examining the adversarial robustness of flow-based lossless compression

Robust object detection for autonomous driving based on semi-supervised learning

SecureSense: Defending Adversarial Attack for Secure Device-Free Human Activity Recognition

Gradient Correction for White-Box Adversarial Attacks.

LAFIT: Efficient and Reliable Evaluation of Adversarial Defenses With Latent Features.

Practical Feature Inference Attack in Vertical Federated Learning During Prediction in Artificial Internet of Things

AD 2 VNCS: A dversarial D efense and D evice V ariation-tolerance in Memristive Crossbar-based N euromorphic C omputing S ystems

AdaNI: Adaptive Noise Injection to improve adversarial robustness

Adv-BDPM: Adversarial attack based on Boundary Diffusion Probability Model

Adversarial image generation by spatial transformation in perceptual colorspaces

On the Privacy of Counting Bloom Filters Under a Black-Box Attacker

CANARY: An Adversarial Robustness Evaluation Platform for Deep Learning Models on Image Classification

LWED: Lightweight white-box encryption communication system for drones over CARX algorithm

Towards evaluating robustness of violence detection in videos using cross-domain transferability