Abstract

Assessing the quality of the software is both important and difficult. For this purpose, software fault prediction (SFP) models have been extensively used. However, selecting the right model and declaring the best out of multiple models are dependent on the performance measures. We analyze 14 frequently used, non-graphic classifier’s performance measures used in SFP studies. These analyses would help machine learning practitioners and researchers in SFP to select the most appropriate performance measure for the models’ evaluation. We analyze the performance measures for resilience against producing invalid values through our proposed plausibility criterion. After that, consistency and discriminancy analyses are performed to find the best out of the 14 performance measures. Finally, we draw the order of the selected performance measures from better to worse in both balance and imbalance datasets. Our analyses conclude that the F-measure and the G-mean1 are equally the best candidates to evaluate the SFP models with careful analysis of the result, as there is a risk of invalid values in certain scenarios.

Highlights

  • Software fault prediction (SFP) is a means to detect faultprone components or a number of expected faults in a software component

  • Our work is significant in the following ways; 1) We discuss as many as 14 performance measures being used in SFP domain

  • Bradle compares Area under the Curve (AUC) with Accuracy [10]. He computes the sensitivity of both measures in Analysis of Variance (ANOVA) and Duncan’s multiple range test and recommends AUC to be used in SFP studies

Read more

Summary

INTRODUCTION

Software fault prediction (SFP) is a means to detect faultprone components or a number of expected faults in a software component. SFP helps in reducing testing cost and improving the quality of the system This can direct the testing team to focus more on the fault-prone modules. There is a sporadic address on the recommendation of the performance measure For these reasons, the objectives of this paper include: 1) A survey of commonly used performance measures in SFP. We list out 14 performance measures which are commonly used in SFP studies. Our work is significant in the following ways; 1) We discuss as many as 14 performance measures being used in SFP domain.

RELATED WORK
PRECISION
G-MEAN1 AND G-MEAN2
J COEFFICIENT
POSITIVE AND NEGATIVE ORIENTED MEASURES
VIII. CONCLUSION
FUTURE GUIDELINES
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call