Comparative analysis of statistical and machine learning methods for predicting faulty modules

Ruchika Malhotra

doi:10.1016/j.asoc.2014.03.032

Abstract

The demand for development of good quality software has seen rapid growth in the last few years. This is leading to increase in the use of the machine learning methods for analyzing and assessing public domain data sets. These methods can be used in developing models for estimating software quality attributes such as fault proneness, maintenance effort, testing effort. Software fault prediction in the early phases of software development can help and guide software practitioners to focus the available testing resources on the weaker areas during the software development. This paper analyses and compares the statistical and six machine learning methods for fault prediction. These methods (Decision Tree, Artificial Neural Network, Cascade Correlation Network, Support Vector Machine, Group Method of Data Handling Method, and Gene Expression Programming) are empirically validated to find the relationship between the static code metrics and the fault proneness of a module. In order to assess and compare the models predicted using the regression and the machine learning methods we used two publicly available data sets AR1 and AR6. We compared the predictive capability of the models using the Area Under the Curve (measured from the Receiver Operating Characteristic (ROC) analysis). The study confirms the predictive capability of the machine learning methods for software fault prediction. The results show that the Area Under the Curve of model predicted using the Decision Tree method is 0.8 and 0.9 (for AR1 and AR6 data sets, respectively) and is a better model than the model predicted using the logistic regression and other machine learning methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative analysis of statistical and machine learning methods for predicting faulty modules

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Mar 31, 2014
Citations: 79

Similar Papers

Which type of metrics are useful to deal with class imbalance in software defect prediction?
Muhammed Maruf Öztürk
Information and Software Technology | VOL. 92
Muhammed Maruf ÖztürkMuhammed Maruf Öztürk
08 Jul 2017
Information and Software Technology | VOL. 92

Investigation of Machine Learning Methods for Prediction of Measured Values of Atmospheric Channel for Hybrid FSO/RF System
Maroš Lapčák ... Norbert Zdravecký
Photonics | VOL. 9
Maroš Lapčák, et. al.Maroš Lapčák ... Norbert Zdravecký
28 Jul 2022
Photonics | VOL. 9

A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
Wuritu Yang ... Xiao-Juan Zhu
Current Bioinformatics | VOL. 14
Wuritu Yang, et. al.Wuritu Yang ... Xiao-Juan Zhu
07 Mar 2019
Current Bioinformatics | VOL. 14

Prediction of Fault-Prone Software Modules using Statistical and Machine Learning Methods
Yogesh Singh ... Ruchika Malhotra
International Journal of Computer Applications | VOL. 1
Yogesh Singh, et. al.Yogesh Singh ... Ruchika Malhotra
25 Feb 2010
International Journal of Computer Applications | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative analysis of statistical and machine learning methods for predicting faulty modules

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing