Software Defect Prediction Using Non-Dominated Sorting Genetic Algorithm and $k$-Nearest Neighbour Classifier

Mohammad Azzeh,Manar Abu Talib,Ali Bou Nassif,Hajra Iqbal

doi:10.37190/e-inf240103

Abstract

Background: Software Defect Prediction (SDP) is a vital step in software development. SDP aims to identify the most likely defect-prone modules before starting the testing phase, and it helps assign resources and reduces the cost of testing. Aim: Although many machine learning algorithms have been used to classify software modules based on static code metrics, the k-Nearest Neighbors (kNN) method does not greatly improve defect prediction because it requires careful set-up of multiple configuration parameters before it can be used. To address this issue, we used the Non-dominated Sorting Genetic Algorithm (NSGA-II) to optimize the parameters in the kNN classifier with favor to improve SDP accuracy. We used NSGA-II because the existing accuracy metrics often behave differently, making an opposite judgment in evaluating SDP models. This means that changing one parameter might improve one accuracy measure while it decreases the others. Method: The proposed NSGAII-kNN model was evaluated against the classical kNN model and state-of-the-art machine learning algorithms such as Support Vector Machine (SVM), Naïve Bayes (NB), and Random Forest (RF) classifiers. Results: Results indicate that the GA-optimized kNN model yields a higher Matthews Coefficient Correlation and higher balanced accuracy based on ten datasets. Conclusion: The paper concludes that integrating GA with kNN improved defect prediction when applied to large or small or large datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Software Defect Prediction Using Non-Dominated Sorting Genetic Algorithm and $k$-Nearest Neighbour Classifier

Abstract

Talk to us

Similar Papers

More From: e-Informatica Software Engineering Journal

Lead the way for us

Journal: e-Informatica Software Engineering Journal	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

Which type of metrics are useful to deal with class imbalance in software defect prediction?
Muhammed Maruf Öztürk
Information and Software Technology | VOL. 92
Muhammed Maruf ÖztürkMuhammed Maruf Öztürk
08 Jul 2017
Information and Software Technology | VOL. 92

Interpretable Software Defect Prediction from Project Effort and Static Code Metrics
Susmita Haldar ... Luiz Fernando Capretz
Computers | VOL. 13
Susmita Haldar, et. al.Susmita Haldar ... Luiz Fernando Capretz
16 Feb 2024
Computers | VOL. 13

Label propagation based semi-supervised learning for software defect prediction
Zhi-Wu Zhang ... Xiao-Yuan Jing
Automated Software Engineering | VOL. 24
Zhi-Wu Zhang, et. al.Zhi-Wu Zhang ... Xiao-Yuan Jing
22 Mar 2016
Automated Software Engineering | VOL. 24

Software Defect Prediction Based on GUHA Data Mining Procedure and Multi-Objective Pareto Efficient Rule Selection
Bharavi Mishra ... K.K Shukla
International Journal of Software Science and Computational Intelligence | VOL. 6
Bharavi Mishra, et. al.Bharavi Mishra ... K.K Shukla
01 Apr 2014
International Journal of Software Science and Computational Intelligence | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Software Defect Prediction Using Non-Dominated Sorting Genetic Algorithm and $k$-Nearest Neighbour Classifier

Abstract

Talk to us

Similar Papers

More From: e-Informatica Software Engineering Journal