Abstract

Deep neural networks are vulnerable to adversarial attacks. Most L0-norm based white-box attacks craft perturbations by the gradient of models to the input. Since the computation cost and memory limitation of the Hessian matrix, the application of Hessian or approximate Hessian in white-box attacks is gradually shelved. In this work, we note that the sparsity requirement on perturbations naturally lends itself to the usage of Hessian information. We study the attack performance and computation cost of the attack method based on Hessian information with a limited number of perturbation pixels. Specifically, we propose the Limited Pixel BFGS (LP-BFGS) attack method by incorporating the perturbation pixel selection strategy and the BFGS algorithm. Pixels with top-k attribution scores calculated by the Integrated Gradient method are regarded as optimization variables of the LP-BFGS attack by the position map matrix. Experimental results across different networks and datasets demonstrate that our approach has promising attack ability with reasonable computation in different numbers of perturbation pixels compared with existing solutions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call