Partial Retraining Substitute Model for Query-Limited Black-Box Attacks

Hosung Park,Daeseon Choi,Gwonsang Ryu

doi:10.3390/app10207168

Hosung Park, Daeseon Choi + Show 1 more

Open Access

https://doi.org/10.3390/app10207168

Copy DOI

Abstract

Black-box attacks against deep neural network (DNN) classifiers are receiving increasing attention because they represent a more practical approach in the real world than white box attacks. In black-box environments, adversaries have limited knowledge regarding the target model. This makes it difficult to estimate gradients for crafting adversarial examples, such that powerful white-box algorithms cannot be directly applied to black-box attacks. Therefore, a well-known black-box attack strategy creates local DNNs, called substitute models, to emulate the target model. The adversaries then craft adversarial examples using the substitute models instead of the unknown target model. The substitute models repeat the query process and are trained by observing labels from the target model’s responses to queries. However, emulating a target model usually requires numerous queries because new DNNs are trained from the beginning. In this study, we propose a new training method for substitute models to minimize the number of queries. We consider the number of queries as an important factor for practical black-box attacks because real-world systems often restrict queries for security and financial purposes. To decrease the number of queries, the proposed method does not emulate the entire target model and only adjusts the partial classification boundary based on a current attack. Furthermore, it does not use queries in the pre-training phase and creates queries only in the retraining phase. The experimental results indicate that the proposed method is effective in terms of the number of queries and attack success ratio against MNIST, VGGFace2, and ImageNet classifiers in query-limited black-box environments. Further, we demonstrate a black-box attack against a commercial classifier, Google AutoML Vision.

Highlights

Deep neural network (DNN) classifiers have made significant progress in many domains such as image classification [1,2], voice recognition [3,4], malware detection [5,6], and natural language processing [7]
This study proposes a new method for training the substitute model with the purpose of decreasing the number of queries
Because of query-limited services and systems present in the real world, minimizing the number of queries is an important factor in practical black-box attacks

Summary

Introduction

Deep neural network (DNN) classifiers have made significant progress in many domains such as image classification [1,2], voice recognition [3,4], malware detection [5,6], and natural language processing [7]. Despite their great success, recent studies have demonstrated that DNNs are vulnerable to well-designed input samples called adversarial examples [8,9]. Manipulated traffic signs can confuse autonomous vehicles [10,11] and adversarial voices can deceive automatic voice recognition models [12,13] such as Apple’s Siri and Amazon’s Alexa

Objectives

Methods

Findings

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Oct 14, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Partial Retraining Substitute Model for Query-Limited Black-Box Attacks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Boosting Targeted Black-Box Attacks via Ensemble Substitute Training and Linear Augmentation
Xianfeng Gao ... Xiaohui Kuang
Applied Sciences | VOL. 9
Xianfeng Gao, et. al.Xianfeng Gao ... Xiaohui Kuang
03 Jun 2019
Applied Sciences | VOL. 9

SMGEA: A New Ensemble Adversarial Attack Powered by Long-Term Gradient Memories.
Zhaohui Che ... Guodong Guo
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33
Zhaohui Che, et. al.Zhaohui Che ... Guodong Guo
01 Mar 2022
IEEE Transactions on Neural Networks and Learning Systems | VOL. 33

Boosting the Transferability of Adversarial Examples with Translation Transformation
Zheming Li ... Xiaohu Liu
Journal of Physics: Conference Series | VOL. 1955
Zheming Li, et. al.Zheming Li ... Xiaohu Liu
01 Jun 2021
Journal of Physics: Conference Series | VOL. 1955

Transferable Adversarial Perturbations
Wen Zhou ... Yongjun Chen
-
Wen Zhou, et. al.Wen Zhou ... Yongjun Chen
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Partial Retraining Substitute Model for Query-Limited Black-Box Attacks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences