Revisiting reopened bugs in open source software systems

Ankur Tagra,Ahmed E Hassan,Haoxiang Zhang,Gopi Krishnan Rajbahadur

doi:10.1007/s10664-022-10133-6

Ankur Tagra, Ahmed E Hassan + Show 2 more

Open Access

https://doi.org/10.1007/s10664-022-10133-6

Copy DOI

Abstract

Reopened bugs can degrade the overall quality of a software system since they require unnecessary rework by developers. Moreover, reopened bugs also lead to a loss of trust in the end-users regarding the quality of the software. Thus, predicting bugs that might be reopened could be extremely helpful for software developers to avoid rework. Prior studies on reopened bug prediction focus only on three open source projects (i.e., Apache, Eclipse, and OpenOffice) to generate insights. We observe that one out of the three projects (i.e., Apache) has a data leak issue – the bug status of reopened was included as training data to predict reopened bugs. In addition, prior studies used an outdated prediction model pipeline (i.e., with old techniques for constructing a prediction model) to predict reopened bugs. Therefore, we revisit the reopened bugs study on a large scale dataset consisting of 47 projects tracked by JIRA using the modern techniques such as SMOTE, permutation importance together with 7 different machine learning models. We study the reopened bugs using a mixed methods approach (i.e., both quantitative and qualitative study). We find that: 1) After using an updated reopened bug prediction model pipeline, only 34% projects give an acceptable performance with AUC $\geqslant $ 0.7. 2) There are four major reasons for a bug getting reopened, that is, technical (i.e., patch/integration issues), documentation, human (i.e., due to incorrect bug assessment), and reasons not shown in the bug reports. 3) In projects with an acceptable AUC, 94% of the reopened bugs are due to patch issues (i.e., the usage of an incorrect patch) identified before bug reopening. Our study revisits reopened bugs and provides new insights into developer’s bug reopening activities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Revisiting reopened bugs in open source software systems

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Apr 25, 2022
Citations: 9

Similar Papers

Code Coverage and Postrelease Defects: A Large-Scale Study on Open Source Projects
Pavneet Singh Kochhar ... Julia Lawall
IEEE Transactions on Reliability | VOL. 66
Pavneet Singh Kochhar, et. al.Pavneet Singh Kochhar ... Julia Lawall
01 Dec 2017
IEEE Transactions on Reliability | VOL. 66

Open Source Development, Communities and Quality
-
-
--
01 Jan 2008
01 Jan 2008

Chaff from the Wheat: Characterizing and Determining Valid Bug Reports
Yuanrui Fan ... David Lo
IEEE Transactions on Software Engineering | VOL. 46
Yuanrui Fan, et. al.Yuanrui Fan ... David Lo
07 Sep 2018
IEEE Transactions on Software Engineering | VOL. 46

Characterizing logging practices in Java-based open source software projects – a replication study in Apache Software Foundation
Boyuan Chen ... Zhen Ming (Jack) Jiang
Empirical Software Engineering | VOL. 22
Boyuan Chen, et. al.Boyuan Chen ... Zhen Ming (Jack) Jiang
07 May 2016
Empirical Software Engineering | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Revisiting reopened bugs in open source software systems

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering