Evaluating and Improving Fault Localization

Spencer Pearson,Deric Pang,Rui Abreu,Michael D Ernst,Jose Campos,Gordon Fraser,Benjamin Keller,Rene Just

doi:10.1109/icse.2017.62

Abstract

Most fault localization techniques take as input a faulty program, and produce as output a ranked list of suspicious code locations at which the program may be defective. When researchers propose a new fault localization technique, they typically evaluate it on programs with known faults. The technique is scored based on where in its output list the defective code appears. This enables the comparison of multiple fault localization techniques to determine which one is better. Previous research has evaluated fault localization techniques using artificial faults, generated either by mutation tools or manually. In other words, previous research has determined which fault localization techniques are best at finding artificial faults. However, it is not known which fault localization techniques are best at finding real faults. It is not obvious that the answer is the same, given previous work showing that artificial faults have both similarities to and differences from real faults. We performed a replication study to evaluate 10 claims in the literature that compared fault localization techniques (from the spectrum-based and mutation-based families). We used 2995 artificial faults in 6 real-world programs. Our results support 7 of the previous claims as statistically significant, but only 3 as having non-negligible effect sizes. Then, we evaluated the same 10 claims, using 310 real faults from the 6 programs. Every previous result was refuted or was statistically and practically insignificant. Our experiments show that artificial faults are not useful for predicting which fault localization techniques perform best on real faults. In light of these results, we identified a design space that includes many previously-studied fault localization techniques as well as hundreds of new techniques. We experimentally determined which factors in the design space are most important, using an overall set of 395 real faults. Then, we extended this design space with new techniques. Several of our novel techniques outperform all existing techniques, notably in terms of ranking defective code in the top-5 or top-10 reports.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating and Improving Fault Localization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluation of fault localization techniques
Spencer Pearson
-
Spencer PearsonSpencer Pearson
01 Nov 2016
01 Nov 2016

An Empirical Study of Boosting Spectrum-Based Fault Localization via PageRank
Mengshi Zhang ... Sarfraz Khurshid
IEEE Transactions on Software Engineering | VOL. 47
Mengshi Zhang, et. al.Mengshi Zhang ... Sarfraz Khurshid
01 Jun 2021
IEEE Transactions on Software Engineering | VOL. 47

Boosting spectrum-based fault localization using PageRank
Mengshi Zhang ... Sarfraz Khurshid
-
Mengshi Zhang, et. al.Mengshi Zhang ... Sarfraz Khurshid
10 Jul 2017
10 Jul 2017

Fault localization for automated program repair: effectiveness, performance, repair correctness
Fatmah Yousef Assiri ... James M Bieman
Software Quality Journal | VOL. 25
Fatmah Yousef Assiri, et. al.Fatmah Yousef Assiri ... James M Bieman
26 Mar 2016
Software Quality Journal | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating and Improving Fault Localization

Abstract

Talk to us

Similar Papers