A STUDY OF SOFTWARE METRIC SELECTION TECHNIQUES: STABILITY ANALYSIS AND DEFECT PREDICTION MODEL PERFORMANCE

Huanjing Wang,Qianhui (Althea) Liang,Taghi M Khoshgoftaar

doi:10.1142/s0218213013600105

Abstract

Software metrics (features or attributes) are collected during the software development cycle. Metric selection is one of the most important preprocessing steps in the process of building defect prediction models and may improve the final prediction result. However, the addition or removal of program modules (instances or samples) can alter the subsets chosen by a feature selection technique, rendering the previously-selected feature sets invalid. Very limited research have been done considering both stability (or robustness) and defect prediction model performance together in the software engineering domain, despite the importance of both aspects when choosing a feature selection technique. In this paper, we test the stability and classification model performance of eighteen feature selection techniques as the magnitude of change to the datasets and the size of the selected feature subsets are varied. All experiments were conducted on sixteen datasets from three real-world software projects. The experimental results demonstrate that Gain Ratio shows the least stability while two different versions of ReliefF show the most stability, followed by the PRC- and AUC-based threshold-based feature selection techniques. Results also show that the signal-to-noise ranker performed moderately in terms of robustness and was the best ranker in terms of model performance. Finally, we conclude that while for some rankers, stability and classification performance are correlated, this is not true for other rankers, and therefore performance according to one scheme (stability or model performance) cannot be used to predict performance according to the other.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A STUDY OF SOFTWARE METRIC SELECTION TECHNIQUES: STABILITY ANALYSIS AND DEFECT PREDICTION MODEL PERFORMANCE

Abstract

Talk to us

Similar Papers

More From: International Journal on Artificial Intelligence Tools

Lead the way for us

Journal: International Journal on Artificial Intelligence Tools	Publication Date: Oct 1, 2013
Citations: 13

Similar Papers

An Empirical Investigation on Wrapper-Based Feature Selection for Predicting Software Quality
Huanjing Wang ... Taghi M Khoshgoftaar
International Journal of Software Engineering and Knowledge Engineering | VOL. 25
Huanjing Wang, et. al.Huanjing Wang ... Taghi M Khoshgoftaar
01 Feb 2015
International Journal of Software Engineering and Knowledge Engineering | VOL. 25

Stability of filter- and wrapper-based software metric selection techniques
Huanjing Wang ... Amri Napolitano
-
Huanjing Wang, et. al.Huanjing Wang ... Amri Napolitano
01 Aug 2014
01 Aug 2014

Stability and Classification Performance of Feature Selection Techniques
Huanjing Wang ... Qianhui Liang
-
Huanjing Wang, et. al. Huanjing Wang ... Qianhui Liang
01 Dec 2011
01 Dec 2011

The impact of feature reduction techniques on defect prediction models
Masanari Kondo ... Cor-Paul Bezemer
Empirical Software Engineering | VOL. 24
Masanari Kondo, et. al.Masanari Kondo ... Cor-Paul Bezemer
22 Jan 2019
Empirical Software Engineering | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A STUDY OF SOFTWARE METRIC SELECTION TECHNIQUES: STABILITY ANALYSIS AND DEFECT PREDICTION MODEL PERFORMANCE

Abstract

Talk to us

Similar Papers

More From: International Journal on Artificial Intelligence Tools