Improved Duplicate Bug Report Identification

Yuan Tian,David Lo,Chengnian Sun

doi:10.1109/csmr.2012.48

Abstract

Bugs are prevalent in software systems. To improve the reliability of software systems, developers often allow end users to provide feedback on bugs that they encounter. Users could perform this by sending a bug report in a bug report management system like Bugzilla. This process however is uncoordinated and distributed, which means that many users could submit bug reports reporting the same problem. These are referred to as duplicate bug reports. The existence of many duplicate bug reports may cause much unnecessary manual efforts as often a triager would need to manually tag bug reports as being duplicates. Recently, there have been a number of studies that investigate duplicate bug report problem which in effect answer the following question: given a new bug report, retrieve k other similar bug reports. This, however, still requires substantive manual effort which could be reduced further. Jalbert and Weimer are the first to introduce the direct detection of duplicate bug reports, it answers the question: given a new bug report, classify if it as a duplicate bug report or not. In this paper, we extend Jalbert and Weimer's work by improving the accuracy of automated duplicate bug report identification. We experiments with bug reports from Mozilla bug tracking system which were reported between February 2005 to October 2005, and find that we could improve the accuracy of the previous approach by about 160%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved Duplicate Bug Report Identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Mar 1, 2012
Citations: 94	License type: cc-by-nc-nd

Similar Papers

Preventing duplicate bug reports by continuously querying bug reports
Abram Hindle ... Curtis Onuczko
Empirical Software Engineering | VOL. 24
Abram Hindle, et. al.Abram Hindle ... Curtis Onuczko
20 Aug 2018
Empirical Software Engineering | VOL. 24

Towards Understanding the Impacts of Textual Dissimilarity on Duplicate Bug Report Detection
Sigma Jahan ... Mohammad Masudur Rahman
-
Sigma Jahan, et. al.Sigma Jahan ... Mohammad Masudur Rahman
01 Mar 2023
01 Mar 2023

Auto-labelling of Bug Report using Natural Language Processing
Avinash Patil ... Aryan Jadon
-
Avinash Patil, et. al.Avinash Patil ... Aryan Jadon
07 Apr 2023
07 Apr 2023

Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
Ashima Kukkar ... Muhammad Bilal
IEEE Access | VOL. 8
Ashima Kukkar, et. al.Ashima Kukkar ... Muhammad Bilal
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved Duplicate Bug Report Identification

Abstract

Talk to us

Similar Papers