Revisiting the Performance Evaluation of Automated Approaches for the Retrieval of Duplicate Issue Reports

Mohamed Sami Rakha,Ahmed E Hassan,Cor-Paul Bezemer

doi:10.1109/tse.2017.2755005

Abstract

Issue tracking systems (ITSs), such as Bugzilla, are commonly used to track reported bugs, improvements and change requests for a software project. To avoid wasting developer resources on previously-reported (i.e., duplicate) issues, it is necessary to identify such duplicates as soon as they are reported. Several automated approaches have been proposed for retrieving duplicate reports, i.e., identifying the duplicate of a new issue report in a list of $n$ candidates. These approaches rely on leveraging the textual, categorical, and contextual information in previously-reported issues to decide whether a newly-reported issue has previously been reported. In general, these approaches are evaluated using data that spans a relatively short period of time (i.e., the classical evaluation). However, in this paper, we show that the classical evaluation tends to overestimate the performance of automated approaches for retrieving duplicate issue reports. Instead, we propose a realistic evaluation using all the reports that are available in the ITS of a software project. We conduct experiments in which we evaluate two popular approaches for retrieving duplicate issues (BM25F and REP) using the classical and realistic evaluations. We find that for the issue tracking data of the Mozilla foundation, the Eclipse foundation and OpenOffice, the realistic evaluation shows that previously proposed approaches perform considerably lower than previously reported using the classical evaluation. As a result, we conclude that the reported performance of approaches for retrieving duplicate issue reports is significantly overestimated in literature. In order to improve the performance of the automated retrieval of duplicate issue reports, we propose to leverage the resolution field of issue reports. Our experiments show that a relative improvement in the performance of a median of 7-21.5 percent and a maximum of 19-60 percent can be achieved by leveraging the resolution field of issue reports for the automated retrieval of duplicates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Revisiting the Performance Evaluation of Automated Approaches for the Retrieval of Duplicate Issue Reports

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering

Lead the way for us

Journal: IEEE Transactions on Software Engineering	Publication Date: Dec 1, 2018
Citations: 66

Similar Papers

Revisiting the performance of automated approaches for the retrieval of duplicate reports in issue tracking systems that perform just-in-time duplicate retrieval
Mohamed Sami Rakha ... Cor-Paul Bezemer
Empirical Software Engineering | VOL. 23
Mohamed Sami Rakha, et. al.Mohamed Sami Rakha ... Cor-Paul Bezemer
23 Jan 2018
Empirical Software Engineering | VOL. 23

Studying the needed effort for identifying duplicate issues
Mohamed Sami Rakha ... Ahmed E Hassan
Empirical Software Engineering | VOL. 21
Mohamed Sami Rakha, et. al.Mohamed Sami Rakha ... Ahmed E Hassan
04 Nov 2015
Empirical Software Engineering | VOL. 21

A fusion approach for classifying duplicate problem reports
Sean Banerjee ... Zahid Syed
-
Sean Banerjee, et. al.Sean Banerjee ... Zahid Syed
01 Nov 2013
01 Nov 2013

Coping with duplicate bug reports in free/open source software projects
J L Davidson ... C Jensen
-
J L Davidson, et. al.J L Davidson ... C Jensen
01 Sep 2011
01 Sep 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Revisiting the Performance Evaluation of Automated Approaches for the Retrieval of Duplicate Issue Reports

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering