Predicting Effectiveness of IR-Based Bug Localization Techniques

Tien-Duy B Le,David Lo,Ferdian Thung

doi:10.1109/issre.2014.39

Abstract

Recently, many information retrieval (IR) based bug localization approaches have been proposed in the literature. These approaches use information retrieval techniques to process a textual bug report and a collection of source code files to find buggy files. They output a ranked list of files sorted by their likelihood to contain the bug. Recent approaches can achieve reasonable accuracy, however, even a state-of-the-art bug localization tool outputs many ranked lists where buggy files appear very low in the lists. This potentially causes developers to distrust bug localization tools. Parnin and Orso recently conduct a user study and highlight that developers do not find an automated debugging tool useful if they do not find the root cause of a bug early in a ranked list. To address this problem, we build an oracle that can automatically predict whether a ranked list produced by an IR-based bug localization tool is likely to be effective or not. We consider a ranked list to be effective if a buggy file appears in the top-N position of the list. If a ranked list is unlikely to be effective, developers do not need to waste time in checking the recommended files one by one. In such cases, it is better for developers to use traditional debugging methods or request for further information to localize bugs. To build this oracle, our approach extracts features that can be divided into four categories: score features, textual features, topic model features, and metadata features. We build a separate prediction model for each category, and combine them to create a composite prediction model which is used as the oracle. We name our proposed approach APRILE, which stands for Automated Prediction of IR-based Bug Localization's Effectiveness. We have evaluated APRILE to predict the effectiveness of three state-of-the-art IR based bug localization tools on more than three thousands bug reports from AspectJ, Eclipse, and SWT. APRILE can achieve an average precision, recall, and F-measure of at least 70.36%, 66.94%, and 68.03%, respectively. Furthermore, APRILE outperforms a baseline approach by 84.48%, 17.74%, and 31.56% for the AspectJ, Eclipse, and SWT bug reports, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predicting Effectiveness of IR-Based Bug Localization Techniques

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Nov 1, 2014
Citations: 59	License type: cc-by-nc-nd

Similar Papers

Will this localization tool be effective for this bug? Mitigating the impact of unreliability of information retrieval based bug localization tools
Tien-Duy B Le ... David Lo
Empirical Software Engineering | VOL. 22
Tien-Duy B Le, et. al.Tien-Duy B Le ... David Lo
08 Dec 2016
Empirical Software Engineering | VOL. 22

Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports (N)
An Ngoc Lam ... Hoan Anh Nguyen
-
An Ngoc Lam, et. al.An Ngoc Lam ... Hoan Anh Nguyen
01 Nov 2015
01 Nov 2015

On the Influence of Biases in Bug Localization: Evaluation and Benchmark
Ratnadira Widyasari ... Constance Tan
-
Ratnadira Widyasari, et. al.Ratnadira Widyasari ... Constance Tan
01 Mar 2022
01 Mar 2022

Bug Localization with Combination of Deep Learning and Information Retrieval
An Ngoc Lam ... Tien N Nguyen
-
An Ngoc Lam, et. al.An Ngoc Lam ... Tien N Nguyen
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting Effectiveness of IR-Based Bug Localization Techniques

Abstract

Talk to us

Similar Papers