Abstraction and mining of traces to explain concurrency bugs

Mitra Tabaei Befrouei,Georg Weissenbacher,Chao Wang

doi:10.1007/s10703-015-0240-5

Mitra Tabaei Befrouei, Georg Weissenbacher + Show 1 more

Open Access

https://doi.org/10.1007/s10703-015-0240-5

Copy DOI

Journal: Formal Methods in System Design	Publication Date: Jan 4, 2016
Citations: 37	License type: CC BY 4.0

Affiliation: TU Wien, Virginia Tech

Abstract

We propose an automated mining-based method for explaining concurrency bugs. We use a data mining technique called sequential pattern mining to identify problematic sequences of concurrent read and write accesses to the shared memory of a multithreaded program. Our technique does not rely on any characteristics specific to one type of concurrency bug, thus providing a general framework for concurrency bug explanation. In our method, given a set of concurrent execution traces, we first mine sequences that frequently occur in failing traces and then rank them based on the number of their occurrences in passing traces. We consider the highly ranked sequences of events that occur frequently only in failing traces an explanation of the system failure, as they can reveal its causes in the execution traces. Since the scalability of sequential pattern mining is limited by the length of the traces, we present an abstraction technique which shortens the traces at the cost of introducing spurious explanations. Spurious as well as misleading explanations are then eliminated by a subsequent filtering step, helping the programmer to focus on likely causes of the failure. We validate our approach using a number of case studies, including synthetic as well as real-world bugs.

Highlights

While Moore’s law is still upheld by increasing the number of cores of processors, the construction of parallel programs that exploit the added computational capacity has become significantly more complicated
To detect and explain concurrency bugs, researchers have focused on a number of problematic program behaviors such as data races and atomicity/serializability violations
It does not guarantee the absence of atomicity violations, which constitute the predominant class of non-deadlock concurrency bugs [17]

Summary

Introduction

While Moore’s law is still upheld by increasing the number of cores of processors, the construction of parallel programs that exploit the added computational capacity has become significantly more complicated This holds true for debugging multithreaded shared-memory software: unexpected interactions between threads may result in erroneous and seemingly nondeterministic program behavior whose root cause is difficult to analyze. Freedom from data races, is neither a necessary nor a sufficient property to establish the correctness of a concurrent program: benign data-races include races that affect the program outcome in a manner acceptable to the programmer [6] It does not guarantee the absence of atomicity violations, which constitute the predominant class of non-deadlock concurrency bugs [17]. Automated atomicity checking depends on heuristics [36] or atomicity annotations [8] to obtain the boundaries of operations and data objects

Objectives

Methods

Findings

Conclusion