Coverage-guided Fuzzing Research Articles

In recent years, significant progress has been made in testing methods for deep neural networks (DNNs) to ensure their correctness and robustness. Coverage-guided criteria, such as neuron-wise, layer-wise, and path-/trace-wise, have been proposed for DNN fuzzing. However, existing coverage-based criteria encounter performance bottlenecks for several reasons: ❶ Testing Adequacy : Partial neural coverage criteria have been observed to achieve full coverage using only a small number of test inputs. In this case, increasing the number of test inputs does not consistently improve the quality of models. ❷ Interpretability : The current coverage criteria lack interpretability. Consequently, testers are unable to identify and understand which incorrect attributes or patterns of the model are triggered by the test inputs. This lack of interpretability hampers the subsequent debugging and fixing process. Therefore, there is an urgent need for a novel fuzzing criterion that offers improved testing adequacy, better interpretability, and more effective failure detection capabilities for DNNs. To alleviate these limitations, we propose NSGen, an approach for DNN fuzzing that utilizes neuron semantics as guidance during test generation. NSGen identifies critical neurons, translates their high-level semantic features into natural language descriptions, and then assembles them into human-readable DNN decision paths (representing the internal decision of the DNN). With these decision paths, we can generate more fault-revealing test inputs by quantifying the similarity between original test inputs and mutated test inputs for fuzzing. We evaluate NSGen on popular DNN models (VGG16_BN, ResNet50, and MobileNet_v2) using CIFAR10, CIFAR100, Oxford 102 Flower, and ImageNet datasets. Compared to 12 existing coverage-guided fuzzing criteria, NSGen outperforms all baselines, increasing the number of triggered faults by 21.4% to 61.2% compared to the state-of-the-art coverage-guided fuzzing criterion. This demonstrates NSGen’s effectiveness in generating fault-revealing test inputs through guided input mutation, highlighting its potential to enhance DNN testing and interpretability.

Read full abstract

Coverage-guided fuzzing is a widely used and effective solution to find software vulnerabilities. Tracking code coverage and utilizing it to guide fuzzing are crucial to coverage-guided fuzzers. However, tracking full and accurate path coverage is infeasible in practice due to the high instrumentation overhead. Popular fuzzers (e.g., AFL) often use <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">coarse</i> coverage information, e.g., edge hit counts stored in a compact bitmap, to achieve highly efficient greybox testing. Such inaccuracy and incompleteness in coverage introduce serious limitations to fuzzers. First, it causes <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">path collisions</i> , which prevent fuzzers from discovering potential paths that lead to new crashes. More importantly, it prevents fuzzers from making wise decisions on fuzzing strategies. In this article, we propose a coverage sensitive fuzzing solution CollAFL. It mitigates path collisions by providing more accurate coverage information, while still preserving low instrumentation overhead. It also utilizes the coverage information to apply three new fuzzing strategies, promoting the speed of discovering new paths and vulnerabilities. We implemented two variants of this solution, namely CollAFL (based on AFL) and CollAFL-bin (based on AFL-dyninst), to test applications with and without source code respectively, and evaluated them on 24 popular applications. The results showed that path collisions are common, i.e., up to 75 percent of edges could collide with others in some applications. But our solutions CollAFL and CollAFL-bin could reduce the edge collision ratio to nearly zero. Moreover, armed with the three fuzzing strategies, they outperform their counterparts (i.e., AFL and AFL-dyninst) in terms of both code coverage and vulnerability discovery. On average, CollAFL covered 20 percent more program paths, and found 320 percent more unique crashes and 260 percent more bugs than AFL in 200 hours. Moreover, CollAFL-bin covered 15 percent more paths, and found 200 percent more unique crashes and 150 percent more vulnerabilities than AFL-dyninst, showing that the proposed solution also works for binary application fuzzing. In total, CollAFL found 157 new security bugs with 95 new CVEs assigned.

Read full abstract

Coverage-guided Fuzzing Research Articles

Related Topics

Articles published on Coverage-guided Fuzzing

Reward Augmentation in Reinforcement Learning for Testing Distributed Systems

Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing

Testing Error Handling Code With Software Fault Injection and Error-Coverage-Guided Fuzzing

Adaptive scheduling-based fine-grained greybox fuzzing for cloud-native applications

Automated SC-MCC test case generation using coverage-guided fuzzing

JMLKelinci+: Detecting Semantic Bugs and Covering Branches with Valid Inputs Using Coverage-guided Fuzzing and Runtime Assertion Checking

A Novel Seed Generation Approach for Vulnerability Mining Based on Generative Adversarial Networks and Attention Mechanisms

Coverage-guided fuzzing for deep reinforcement learning systems

Deep Learning for Coverage-Guided Fuzzing: How Far are We?

Synergistic Verification of Hardware Peripherals through Virtual Prototype Aided Cross-Level Methodology Leveraging Coverage-Guided Fuzzing and Co-Simulation

Accelerating Fuzzing through Prefix-Guided Execution

GradFuzz: Fuzzing deep neural networks with gradient vector coverage for adversarial examples

NDFuzz: a non-intrusive coverage-guided fuzzing framework for virtualized network devices

Tardis: Coverage-Guided Embedded Operating System Fuzzing

Mixed and constrained input mutation for effective fuzzing of deep learning systems

MalFuzz: Coverage-guided fuzzing on deep learning-based malware classification model.

AflIot: Fuzzing on linux-based IoT device with binary-level instrumentation

StFuzzer: Contribution-Aware Coverage-Guided Fuzzing for Smart Devices

ReFuzz: A Remedy for Saturation in Coverage-Guided Fuzzing

Path Sensitive Fuzzing for Native Applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Coverage-guided Fuzzing Research Articles

Related Topics

Articles published on Coverage-guided Fuzzing

Reward Augmentation in Reinforcement Learning for Testing Distributed Systems

Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing

Testing Error Handling Code With Software Fault Injection and Error-Coverage-Guided Fuzzing

Adaptive scheduling-based fine-grained greybox fuzzing for cloud-native applications

Automated SC-MCC test case generation using coverage-guided fuzzing

JMLKelinci+: Detecting Semantic Bugs and Covering Branches with Valid Inputs Using Coverage-guided Fuzzing and Runtime Assertion Checking

A Novel Seed Generation Approach for Vulnerability Mining Based on Generative Adversarial Networks and Attention Mechanisms

Coverage-guided fuzzing for deep reinforcement learning systems

Deep Learning for Coverage-Guided Fuzzing: How Far are We?

Synergistic Verification of Hardware Peripherals through Virtual Prototype Aided Cross-Level Methodology Leveraging Coverage-Guided Fuzzing and Co-Simulation

Accelerating Fuzzing through Prefix-Guided Execution

GradFuzz: Fuzzing deep neural networks with gradient vector coverage for adversarial examples

NDFuzz: a non-intrusive coverage-guided fuzzing framework for virtualized network devices

Tardis: Coverage-Guided Embedded Operating System Fuzzing

Mixed and constrained input mutation for effective fuzzing of deep learning systems

MalFuzz: Coverage-guided fuzzing on deep learning-based malware classification model.

AflIot: Fuzzing on linux-based IoT device with binary-level instrumentation

StFuzzer: Contribution-Aware Coverage-Guided Fuzzing for Smart Devices

ReFuzz: A Remedy for Saturation in Coverage-Guided Fuzzing

Path Sensitive Fuzzing for Native Applications