A survey on software fault detection based on different prediction approaches

Golnoush Abaei,Ali Selamat

doi:10.1007/s40595-013-0008-z

Abstract

One of the software engineering interests is quality assurance activities such as testing, verification and validation, fault tolerance and fault prediction. When any company does not have sufficient budget and time for testing the entire application, a project manager can use some fault prediction algorithms to identify the parts of the system that are more defect prone. There are so many prediction approaches in the field of software engineering such as test effort, security and cost prediction. Since most of them do not have a stable model, software fault prediction has been studied in this paper based on different machine learning techniques such as decision trees, decision tables, random forest, neural network, Naïve Bayes and distinctive classifiers of artificial immune systems (AISs) such as artificial immune recognition system, CLONALG and Immunos. We use four public NASA datasets to perform our experiment. These datasets are different in size and number of defective data. Distinct parameters such as method-level metrics and two feature selection approaches which are principal component analysis and correlation based feature selection are used to evaluate the finest performance among the others. According to this study, random forest provides the best prediction performance for large data sets and Naïve Bayes is a trustable algorithm for small data sets even when one of the feature selection techniques is applied. Immunos99 performs well among AIS classifiers when feature selection technique is applied, and AIRSParallel performs better without any feature selection techniques. The performance evaluation has been done based on three different metrics such as area under receiver operating characteristic curve, probability of detection and probability of false alarm. These three evaluation metrics could give the reliable prediction criteria together.

Highlights

As today’s software grows rapidly in size and complexity, the prediction of software reliability plays a crucial role in software development process [1]
We identified fault prediction algorithms based on different machine learning classifiers and distinct feature selection techniques
This study shows that applying different feature selection techniques does not have that much effect on the results; they mainly reduce the execution time

Summary

Introduction

As today’s software grows rapidly in size and complexity, the prediction of software reliability plays a crucial role in software development process [1]. Research questions are listed as follows: RQ1: which of the machine learning algorithms performs best on small and large datasets when 21-method-level metrics is used?. RQ2: which of the AIS algorithms performs best on small and large datasets when 21-method-level metrics is used?. RQ3: which of the machine learning algorithms performs best on small and large datasets when 37-method-level metrics is used?. RQ4: which of the machine learning algorithms performs best on small and large datasets when PCA and CFS applied on 21-method-level metrics?. RQ5: which of the AIS algorithms performs best on small and large datasets when PCA and CFS applied on 21method-level metrics?. In experiment 4, to answer the last question, we doubled the defect rate of CM1 dataset to see whether it has any effect on the prediction model performances or not.

Related works

Artificial immune system

CLONALG

Immunos81

Feature selection

Principal component analysis

Correlation-based feature selection

Dataset selection

Variable selection

Performance measurements criteria

Experiment 1

Experiment 2

Experiment 3

Experiment 4

Findings

Summary and conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Vietnam Journal of Computer Science	Publication Date: Nov 30, 2013
Citations: 101	License type: cc-by

R Discovery Prime

R Discovery Prime

A survey on software fault detection based on different prediction approaches

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Vietnam Journal of Computer Science

Lead the way for us

Similar Papers

Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem
Cagatay Catal ... Banu Diri
Information sciences | VOL. 179
Cagatay Catal, et. al.Cagatay Catal ... Banu Diri
16 Dec 2008
Information sciences | VOL. 179

Empirical evaluation of the performance of data sampling and feature selection techniques for software fault prediction
Sonika Chandrakant Rathi ... Lov Kumar
Expert systems with applications | VOL. 223
Sonika Chandrakant Rathi, et. al.Sonika Chandrakant Rathi ... Lov Kumar
17 Mar 2023
Expert systems with applications | VOL. 223

Micro-interaction Metrics Based Software Defect Prediction with Machine Learning, Immune Inspired and Evolutionary Classifiers: An Empirical Study
Arvinder Kaur ... Kamadeep Kaur
-
Arvinder Kaur, et. al.Arvinder Kaur ... Kamadeep Kaur
01 Jan 2015
01 Jan 2015

Software Fault Prediction with Object-Oriented Metrics Based Artificial Immune Recognition System
Cagatay Catal ... Banu Diri
-
Cagatay Catal, et. al.Cagatay Catal ... Banu Diri
02 Jul 2007
02 Jul 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A survey on software fault detection based on different prediction approaches

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Vietnam Journal of Computer Science