Generation of Black-box Audio Adversarial Examples Based on Gradient Approximation and Autoencoders

Po-Hao Huang,Max Panoff,Honggang Yu,Ting-Chi Wang

doi:10.1145/3491220

Abstract

Deep Neural Network (DNN) is gaining popularity thanks to its ability to attain high accuracy and performance in various security-crucial scenarios. However, recent research shows that DNN-based Automatic Speech Recognition (ASR) systems are vulnerable to adversarial attacks. Specifically, these attacks mainly focus on formulating a process of adversarial example generation as iterative, optimization-based attacks. Although these attacks make significant progress, they still take large generation time to produce adversarial examples, which makes them difficult to be launched in real-world scenarios. In this article, we propose a real-time attack framework that utilizes the neural network trained by the gradient approximation method to generate adversarial examples on Keyword Spotting (KWS) systems. The experimental results show that these generated adversarial examples can easily fool a black-box KWS system to output incorrect results with only one inference. In comparison to previous works, our attack can achieve a higher success rate with less than 0.004 s. We also extend our work by presenting a novel ensemble audio adversarial attack and testing the attack on KWS systems equipped with existing defense mechanisms. The efficacy of the proposed attack is well supported by promising experimental results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generation of Black-box Audio Adversarial Examples Based on Gradient Approximation and Autoencoders

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems

Lead the way for us

Journal: ACM Journal on Emerging Technologies in Computing Systems	Publication Date: Jul 31, 2022
Citations: 1

Similar Papers

Audio Adversarial Examples Generation with Recurrent Neural Networks
Kuei-Huan Chang ... Po-Hao Huang
-
Kuei-Huan Chang, et. al.Kuei-Huan Chang ... Po-Hao Huang
01 Jan 2020
01 Jan 2020

Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Mingyu Dong ... Diqun Yan
Journal of the Audio Engineering Society | VOL. 71
Mingyu Dong, et. al.Mingyu Dong ... Diqun Yan
16 Jan 2023
Journal of the Audio Engineering Society | VOL. 71

Developing STT and KWS systems using limited language resources
Viet-Bac Le ... Jean-Luc Gauvain
-
Viet-Bac Le, et. al.Viet-Bac Le ... Jean-Luc Gauvain
14 Sep 2014
14 Sep 2014

Different confidence measures for word verification in speech recognition
M.C Benı́Tez ... A De La Torre
Speech Communication | VOL. 32
M.C Benı́Tez, et. al.M.C Benı́Tez ... A De La Torre
14 Aug 2000
Speech Communication | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation of Black-box Audio Adversarial Examples Based on Gradient Approximation and Autoencoders

Abstract

Talk to us

Similar Papers

More From: ACM Journal on Emerging Technologies in Computing Systems