DCatch

Haopeng Liu,Guangpu Li,Chen Tian,Jeffrey F Lukman,Haryadi S Gunawi,Shan Lu,Jiaxin Li

doi:10.1145/3093336.3037735

Abstract

In big data and cloud computing era, reliability of distributed systems is extremely important. Unfortunately, distributed concurrency bugs, referred to as DCbugs, widely exist. They hide in the large state space of distributed cloud systems and manifest non-deterministically depending on the timing of distributed computation and communication. Effective techniques to detect DCbugs are desired. This paper presents a pilot solution, DCatch, in the world of DCbug detection. DCatch predicts DCbugs by analyzing correct execution of distributed systems. To build DCatch, we design a set of happens-before rules that model a wide variety of communication and concurrency mechanisms in real-world distributed cloud systems. We then build runtime tracing and trace analysis tools to effectively identify concurrent conflicting memory accesses in these systems. Finally, we design tools to help prune false positives and trigger DCbugs. We have evaluated DCatch on four representative open-source distributed cloud systems, Cassandra, Hadoop MapReduce, HBase, and ZooKeeper. By monitoring correct execution of seven workloads on these systems, DCatch reports 32 DCbugs, with 20 of them being truly harmful.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DCatch

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices

Lead the way for us

Journal: ACM SIGPLAN Notices	Publication Date: Apr 4, 2017
Citations: 4

Similar Papers

DCatch
Haopeng Liu ... Shan Lu
-
Haopeng Liu, et. al.Haopeng Liu ... Shan Lu
04 Apr 2017
04 Apr 2017

DCatch
Haopeng Liu ... Jiaxin Li
ACM SIGARCH Computer Architecture News | VOL. 45
Haopeng Liu, et. al.Haopeng Liu ... Jiaxin Li
04 Apr 2017
ACM SIGARCH Computer Architecture News | VOL. 45

Understanding and Statically Detecting Synchronization Performance Bugs in Distributed Cloud Systems
Chen Zhang ... Dongsheng Li
IEEE Access | VOL. 7
Chen Zhang, et. al.Chen Zhang ... Dongsheng Li
01 Jan 2019
IEEE Access | VOL. 7

Intellectual scaling in a distributed cloud application architecture: A message classification algorithm
Oleg Iakushkin
-
Oleg IakushkinOleg Iakushkin
01 Oct 2015
01 Oct 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DCatch

Abstract

Talk to us

Similar Papers

More From: ACM SIGPLAN Notices