SMOTE and Feature Selection for More Effective Bug Severity Prediction

Abeer Hamdy,Abdulrahman El-Laithy

doi:10.1142/s0218194019500311

Abstract

“Severity” is one of the essential features of software bug reports, which is a crucial factor for developers to decide which bug should be fixed immediately and which bug could be delayed to a next release. Severity assignment is a manual process and its accuracy depends on the experience of the assignee. Prior research proposed several models to automate this process. These models are based on textual preprocessing of historical bug reports and classification techniques. Although bug repositories suffer from severity class imbalance, none of the prior studies investigated the impact of implementing a class rebalancing technique on the accuracy of their models. In this paper, we propose a framework for predicting fine-grained severity levels which utilizes an over-sampling technique “SMOTE”, to balance the severity classes, and a feature selection scheme, to reduce the data scale and select the most informative features for training a [Formula: see text]-nearest neighbor (KNN) classifier. The KNN classifier utilizes a distance-weighted voting scheme to predict the proper severity level of a newly reported bug. We investigated the effectiveness of our proposed approach on two large bug repositories, namely Eclipse and Mozilla, and the experimental results showed that our approach outperforms cutting-edge studies in predicting the minority severity classes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SMOTE and Feature Selection for More Effective Bug Severity Prediction

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering

Lead the way for us

Journal: International Journal of Software Engineering and Knowledge Engineering	Publication Date: Jun 1, 2019
Citations: 20

Similar Papers

An Idea of setting weighting functions for feature selection
Weijie Li ... Haiqiang Chen
-
Weijie Li, et. al.Weijie Li ... Haiqiang Chen
01 Oct 2012
01 Oct 2012

KSAP: An approach to bug report assignment using KNN search and heterogeneous proximity
Wen Zhang ... Qing Wang
Information and Software Technology | VOL. 70
Wen Zhang, et. al.Wen Zhang ... Qing Wang
26 Oct 2015
Information and Software Technology | VOL. 70

Detection of coronary calcifications from computed tomography scans for automated risk assessment of coronary artery disease
Ivana Išgum ... Mathias Prokop
Medical Physics | VOL. 34
Ivana Išgum, et. al.Ivana Išgum ... Mathias Prokop
23 Mar 2007
Medical Physics | VOL. 34

Adaptive Learning-Based -Nearest Neighbor Classifiers With Resilience to Class Imbalance.
Sankha Subhra Mullick ... Shounak Datta
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Sankha Subhra Mullick, et. al.Sankha Subhra Mullick ... Shounak Datta
27 Mar 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SMOTE and Feature Selection for More Effective Bug Severity Prediction

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering