An Empirical Validation of Learning Schemes Using an Automated Genetic Defect Prediction Framework

Juan Murillo-Morera,Rubén Fuentes-Fernández,Javier Arroyo,Carlos Castro-Herrera

doi:10.1007/978-3-319-47955-2_19

Juan Murillo-Morera, Rubén Fuentes-Fernández + Show 2 more

https://doi.org/10.1007/978-3-319-47955-2_19

Copy DOI

Abstract

Today, it is common for software projects to collect measurement data through development processes. With these data, defect prediction software can try to estimate the defect proneness of a software module, with the objective of assisting and guiding software practitioners. With timely and accurate defect predictions, practitioners can focus their limited testing resources on higher risk areas. This paper reports a benchmarking study that uses a genetic algorithm that automatically generates and compares different learning schemes (preprocessing + attribute selection + learning algorithms). Performance of the software development defect prediction models (using AUC, Area Under the Curve) was validated using NASA-MDP and PROMISE data sets. Twelve data sets from NASA-MDP (8) and PROMISE (4) projects were analyzed running a \(M\times N\)-fold cross-validation. We used a genetic algorithm to select the components of the learning schemes automatically, and to evaluate and report those with the best performance. In all, 864 learning schemes were studied. The most common learning schemes were: data preprocessors: Log and CoxBox + attribute selectors: Backward Elimination, BestFirst and LinearForwardSelection + learning algorithms: NaiveBayes, NaiveBayesSimple, SimpleLogistic, MultilayerPerceptron, Logistic, LogitBoost, BayesNet, and OneR. The genetic algorithm reported steady performance and runtime among data sets, according to statistical analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Empirical Validation of Learning Schemes Using an Automated Genetic Defect Prediction Framework

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Automated Defect Prediction Framework using Genetic Algorithms: A Validation of Empirical Studies
Juan Murillo
INTELIGENCIA ARTIFICIAL | VOL. 19
Juan MurilloJuan Murillo
01 Jan 2015
INTELIGENCIA ARTIFICIAL | VOL. 19

A genetic algorithm based framework for software effort prediction
Juan Murillo-Morera ... Carlos Castro-Herrera
Journal of Software Engineering Research and Development | VOL. 5
Juan Murillo-Morera, et. al.Juan Murillo-Morera ... Carlos Castro-Herrera
31 May 2017
Journal of Software Engineering Research and Development | VOL. 5

The Empirical Study of Semi-Supervised Deep Fuzzy C-Mean Clustering for Software Fault Prediction
Ali Arshad ... Licheng Jiao
IEEE Access | VOL. 6
Ali Arshad, et. al.Ali Arshad ... Licheng Jiao
01 Jan 2018
IEEE Access | VOL. 6

Defect Prediction for Object Oriented Software using Support Vector based Fuzzy Classification Model
Bharavi Mishraand K.K.Shukla
International Journal of Computer Applications | VOL. 60
Bharavi Mishraand K.K.ShuklaBharavi Mishraand K.K.Shukla
18 Dec 2012
International Journal of Computer Applications | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Empirical Validation of Learning Schemes Using an Automated Genetic Defect Prediction Framework

Abstract

Talk to us

Similar Papers