Bug Detection Research Articles

Machine learning and its promising branch deep learning have proven to be effective in a wide range of application domains. Recently, several efforts have shown success in applying deep learning techniques for automatic vulnerability discovery, as alternatives to traditional static bug detection. In principle, these learning-based approaches are built on top of classification models using supervised learning. Depending on the different granularities to detect vulnerabilities, these approaches rely on learning models which are typically trained with well-labeled source code to predict whether a program method, a program slice, or a particular code line contains a vulnerability or not. The effectiveness of these models is normally evaluated against conventional metrics including precision, recall and F1 score. In this paper, we show that despite yielding promising numbers, the above evaluation strategy can be insufficient and even misleading when evaluating the effectiveness of current learning-based approaches. This is because the underlying learning models only produce the classification results or report individual/isolated program statements, but are unable to pinpoint bug-triggering paths, which is an effective way for bug fixing and the main aim of static bug detection. Our key insight is that a program method or statement can only be stated as vulnerable in the context of a bug-triggering path. In this work, we systematically study the gap between recent learning-based approaches and conventional static bug detectors in terms of fine-grained metrics called BTP metrics using bug-triggering paths. We then characterize and compare the quality of the prediction results of existing learning-based detectors under different granularities. Finally, our comprehensive empirical study reveals several key issues and challenges in developing classification models to pinpoint bug-triggering paths and calls for more advanced learning-based bug detection techniques.

Read full abstract

While current bug detection techniques for concurrent software focus on unearthing low-level issues such as data races or deadlocks, they often fall short of discovering more intricate temporal behaviours that can arise even in the absence of such low-level issues. In this paper, we focus on the problem of dynamically analysing concurrent software against high-level temporal specifications such as LTL. Existing techniques for runtime monitoring against such specifications are primarily designed for sequential software and remain inadequate in the presence of concurrency — violations may be observed only in intricate thread interleavings, requiring many re-runs of the underlying software in conjunction with the analysis. Towards this, we study the problem of predictive runtime monitoring , inspired by the analogous problem of predictive data race detection studied extensively recently. The predictive runtime monitoring question asks, given an execution σ, if it can be soundly reordered to expose violations of a specification. In general, this problem may become easily intractable when either the specifications or the notion of reorderings used is complex. In this paper, we focus on specifications that are given in regular languages. Our notion of reorderings is trace equivalence , where an execution is considered a reordering of another if it can be obtained from the latter by successively commuting adjacent independent actions. We first show that, even in this simplistic setting, the problem of predictive monitoring admits a super-linear lower bound of O ( n α ), where n is the number of events in the execution, and α is a parameter describing the degree of commutativity, and typically corresponds to the number of threads in the execution. As a result, predictive runtime monitoring even in this setting is unlikely to be efficiently solvable, unlike in the non-predictive setting where the problem can be checked using a deterministic finite automaton (and thus, a constant-space streaming linear-time algorithm). Towards this, we identify a sub-class of regular languages, called pattern languages (and their extension generalized pattern languages ). Pattern languages can naturally express specific ordering of some number of (labelled) events, and have been inspired by popular empirical hypotheses underlying many concurrency bug detection approaches such as the “small bug depth” hypothesis. More importantly, we show that for pattern (and generalized pattern) languages, the predictive monitoring problem can be solved using a constant-space streaming linear-time algorithm. We implement and evaluate our algorithm PatternTrack on benchmarks from the literature and show that it is effective in monitoring large-scale applications.

Read full abstract

Bug Detection Research Articles

Related Topics

Articles published on Bug Detection

How About Bug-Triggering Paths? - Understanding and Characterizing Learning-Based Vulnerability Detectors

AI-Assisted Programming Tasks Using Code Embeddings and Transformers

Predictive Monitoring against Pattern Regular Languages

Logic Bug Detection and Localization Using Symbolic Quick Error Detection

On the effectiveness of graph data augmentation for source code learning

CLFuzz: Vulnerability Detection of Cryptographic Algorithm Implementation via Semantic-aware Fuzzing

Characterizing and Detecting WebAssembly Runtime Bugs

Extending the range of bugs that automated program repair can handle

The progress, challenges, and perspectives of directed greybox fuzzing

Moving from Cloud to Fog/Edge: The Smart Agriculture Experience

An open-source natural language processing toolkit to support software development: addressing automatic bug detection, code summarisation and code search.

Toward Understanding Deep Learning Framework Bugs

RNNtcs: A test case selection method for Recurrent Neural Networks

Automatic software bug prediction using adaptive golden eagle optimizer with deep learning

Automatic Software Bug Prediction Using Adaptive Artificial Jelly Optimization With Long Short-Term Memory

Runtime Verification for Programmable Switches

A Demonstration of DLBD: Database Logic Bug Detection System

SEAL: Integrating Program Analysis and Repository Mining

Predicting Aging-Related Bugs Using Network Analysis on Aging-Related Dependency Networks

Performance Bug Analysis and Detection for Distributed Storage and Computing Systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bug Detection Research Articles

Related Topics

Articles published on Bug Detection

How About Bug-Triggering Paths? - Understanding and Characterizing Learning-Based Vulnerability Detectors

AI-Assisted Programming Tasks Using Code Embeddings and Transformers

Predictive Monitoring against Pattern Regular Languages

Logic Bug Detection and Localization Using Symbolic Quick Error Detection

On the effectiveness of graph data augmentation for source code learning

CLFuzz: Vulnerability Detection of Cryptographic Algorithm Implementation via Semantic-aware Fuzzing

Characterizing and Detecting WebAssembly Runtime Bugs

Extending the range of bugs that automated program repair can handle

The progress, challenges, and perspectives of directed greybox fuzzing

Moving from Cloud to Fog/Edge: The Smart Agriculture Experience

An open-source natural language processing toolkit to support software development: addressing automatic bug detection, code summarisation and code search.

Toward Understanding Deep Learning Framework Bugs

RNNtcs: A test case selection method for Recurrent Neural Networks

Automatic software bug prediction using adaptive golden eagle optimizer with deep learning

Automatic Software Bug Prediction Using Adaptive Artificial Jelly Optimization With Long Short-Term Memory

Runtime Verification for Programmable Switches

A Demonstration of DLBD: Database Logic Bug Detection System

SEAL: Integrating Program Analysis and Repository Mining

Predicting Aging-Related Bugs Using Network Analysis on Aging-Related Dependency Networks

Performance Bug Analysis and Detection for Distributed Storage and Computing Systems