Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

Abdullateef Oluwagbemiga Balogun,Shuib Basri,Said Jadid Abdulkadir,Ahmad Sobri Hashim

doi:10.3390/app9132764

Abdullateef Oluwagbemiga Balogun, Shuib Basri + Show 2 more

Open Access

https://doi.org/10.3390/app9132764

Copy DOI

Abstract

Software Defect Prediction (SDP) models are built using software metrics derived from software systems. The quality of SDP models depends largely on the quality of software metrics (dataset) used to build the SDP models. High dimensionality is one of the data quality problems that affect the performance of SDP models. Feature selection (FS) is a proven method for addressing the dimensionality problem. However, the choice of FS method for SDP is still a problem, as most of the empirical studies on FS methods for SDP produce contradictory and inconsistent quality outcomes. Those FS methods behave differently due to different underlining computational characteristics. This could be due to the choices of search methods used in FS because the impact of FS depends on the choice of search method. It is hence imperative to comparatively analyze the FS methods performance based on different search methods in SDP. In this paper, four filter feature ranking (FFR) and fourteen filter feature subset selection (FSS) methods were evaluated using four different classifiers over five software defect datasets obtained from the National Aeronautics and Space Administration (NASA) repository. The experimental analysis showed that the application of FS improves the predictive performance of classifiers and the performance of FS methods can vary across datasets and classifiers. In the FFR methods, Information Gain demonstrated the greatest improvements in the performance of the prediction models. In FSS methods, Consistency Feature Subset Selection based on Best First Search had the best influence on the prediction models. However, prediction models based on FFR proved to be more stable than those based on FSS methods. Hence, we conclude that FS methods improve the performance of SDP models, and that there is no single best FS method, as their performance varied according to datasets and the choice of the prediction model. However, we recommend the use of FFR methods as the prediction models based on FFR are more stable in terms of predictive performance.

Highlights

Software Defect Prediction (SDP) models are built using software metrics which based on data collected from the previous developed system or similar software projects [1]
The performances of each prediction models were analyzed based on accuracy and the results were compared on two cases
The performance of SDP depends on the quality of software defect datasets which suffers from high dimensionality

Summary

Introduction

Software Defect Prediction (SDP) models are built using software metrics which based on data collected from the previous developed system or similar software projects [1]. Using such a model, the defect-proneness of the software modules under development can be predicted. The goal of SDP is to achieve high software quality and reliability with the effective use of available limited resources. SDP involves identifying software modules or components that are prone to defects This will avail software engineers to prioritize the utilization of inhibited resources during each phase of the software development [2,3].

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jul 9, 2019
Citations: 79	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study
Abdullateef O Balogun ... Malek A Almomani
Symmetry | VOL. 12
Abdullateef O Balogun, et. al.Abdullateef O Balogun ... Malek A Almomani
09 Jul 2020
Symmetry | VOL. 12

ELM and KELM based software defect prediction using feature selection techniques
Ishani Arora ... Anju Saha
Journal of Information and Optimization Sciences | VOL. 40
Ishani Arora, et. al.Ishani Arora ... Anju Saha
04 Jul 2019
Journal of Information and Optimization Sciences | VOL. 40

Research on software defect prediction technology based on deep learning
Pengcheng Jiang
-
Pengcheng JiangPengcheng Jiang
01 Jan 2020
01 Jan 2020

Is Open-Source Software Valuable for Software Defect Prediction of Proprietary Software and Vice Versa?
Misha Kakkar ... P S Grover
-
Misha Kakkar, et. al.Misha Kakkar ... P S Grover
25 Nov 2017
25 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences