A Systematic Review of Feature Selection Techniques in Software Quality Prediction

Hadeel Alsolai,Marc Roper

doi:10.1109/icecta48151.2019.8959566

Abstract

Background: Feature selection techniques are important factors for improving machine learning models because they increase prediction accuracy and decrease the time to create a model. Recently, feature selection techniques have been employed on software quality prediction problems with different results and no clear indication of which techniques are frequently used.Objective: This study aims to conduct a systematic review of the application of feature selection techniques in software quality prediction and answers eight research questions.Method: The review evaluates 15 papers in 9 journals and 6 conference proceedings from 2007 to 2017 using the standard systematic literature review method.Results: The results obtained from this study reveal that the filter feature selection method was the most commonly used in the studies (60%) and RELIEF was the most employed among this method, and a limited number of studies employed an ensemble method. Several studies used public datasets available in the PROMISE software project repository (60%). Most studies focused on software defect prediction (classification problem) using area under curve (AUC) as a primary evaluation measure, whereas only two studies focused on software maintainability prediction (regression problem) using mean magnitude of relative error (MMRE) as a primary evaluation measure. All selected studies performed k-fold cross-validation to evaluate model accuracy. Individual prediction models were mostly employed and ensemble models appeared only in three studies. Naive Bayes was the most investigated among individual models, whereas Random forest was the most investigated among ensemble models.Conclusion: Feature selection techniques used by selected primary studies have a positive impact on the performance of the prediction models. Further, both ensemble feature selection method and ensemble models have the ability for increasing prediction accuracy over single methods or individual models and have reported improvement in the prediction accuracy; however, the application of these techniques in software quality prediction is still limited.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Systematic Review of Feature Selection Techniques in Software Quality Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A systematic literature review of machine learning techniques for software maintainability prediction
Hadeel Alsolai ... Marc Roper
Information and Software Technology | VOL. 119
Hadeel Alsolai, et. al.Hadeel Alsolai ... Marc Roper
31 Oct 2019
Information and Software Technology | VOL. 119

The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction
Kehan Gao ... Taghi M Khoshgoftaar
International Journal of Software Engineering and Knowledge Engineering | VOL. 24
Kehan Gao, et. al.Kehan Gao ... Taghi M Khoshgoftaar
01 Nov 2014
International Journal of Software Engineering and Knowledge Engineering | VOL. 24

An Ensemble Approach of Feature Selection and Machine Learning Models for Regional Landslide Susceptibility Mapping in the Arid Mountainous Terrain of Southern Peru
Chandan Kumar ... Carlos Luza
Remote Sensing | VOL. 15
Chandan Kumar, et. al.Chandan Kumar ... Carlos Luza
28 Feb 2023
Remote Sensing | VOL. 15

An Adaptive Rank Aggregation-Based Ensemble Multi-Filter Feature Selection Method in Software Defect Prediction.
Abdullateef O Balogun ... Luiz Fernando Capretz
Entropy | VOL. 23
Abdullateef O Balogun, et. al.Abdullateef O Balogun ... Luiz Fernando Capretz
29 Sep 2021
Entropy | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Systematic Review of Feature Selection Techniques in Software Quality Prediction

Abstract

Talk to us

Similar Papers