Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing

Li Huang,David Lo,Yan Lei,Weifeng Sun,Zhongxin Liu,Meng Yan

doi:10.1145/3688835

Abstract

In recent years, significant progress has been made in testing methods for deep neural networks (DNNs) to ensure their correctness and robustness. Coverage-guided criteria, such as neuron-wise, layer-wise, and path-/trace-wise, have been proposed for DNN fuzzing. However, existing coverage-based criteria encounter performance bottlenecks for several reasons: ❶ Testing Adequacy : Partial neural coverage criteria have been observed to achieve full coverage using only a small number of test inputs. In this case, increasing the number of test inputs does not consistently improve the quality of models. ❷ Interpretability : The current coverage criteria lack interpretability. Consequently, testers are unable to identify and understand which incorrect attributes or patterns of the model are triggered by the test inputs. This lack of interpretability hampers the subsequent debugging and fixing process. Therefore, there is an urgent need for a novel fuzzing criterion that offers improved testing adequacy, better interpretability, and more effective failure detection capabilities for DNNs. To alleviate these limitations, we propose NSGen, an approach for DNN fuzzing that utilizes neuron semantics as guidance during test generation. NSGen identifies critical neurons, translates their high-level semantic features into natural language descriptions, and then assembles them into human-readable DNN decision paths (representing the internal decision of the DNN). With these decision paths, we can generate more fault-revealing test inputs by quantifying the similarity between original test inputs and mutated test inputs for fuzzing. We evaluate NSGen on popular DNN models (VGG16_BN, ResNet50, and MobileNet_v2) using CIFAR10, CIFAR100, Oxford 102 Flower, and ImageNet datasets. Compared to 12 existing coverage-guided fuzzing criteria, NSGen outperforms all baselines, increasing the number of triggered faults by 21.4% to 61.2% compared to the state-of-the-art coverage-guided fuzzing criterion. This demonstrates NSGen’s effectiveness in generating fault-revealing test inputs through guided input mutation, highlighting its potential to enhance DNN testing and interpretability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Similar Papers

A New Compression Method for Deep Neural Networks with Accuracy Improvement
Wang Xiao ... Liu Han
-
Wang Xiao, et. al.Wang Xiao ... Liu Han
26 Jul 2021
26 Jul 2021

Biologically motivated learning method for deep neural networks using hierarchical competitive learning
Takashi Shinozaki
Neural Networks | VOL. 144
Takashi ShinozakiTakashi Shinozaki
03 Sep 2021
Neural Networks | VOL. 144

MuNN: Mutation Analysis of Neural Networks
Weijun Shen ... Zhenyu Chen
-
Weijun Shen, et. al.Weijun Shen ... Zhenyu Chen
01 Jul 2018
01 Jul 2018

Using deep learning to associate human genes with age-related diseases.
Fabio Fabris ... Khalid M Salama
Bioinformatics | VOL. 36
Fabio Fabris, et. al.Fabio Fabris ... Khalid M Salama
17 Dec 2019
Bioinformatics | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology