Evolutionary Optimization of Software Quality Modeling with Multiple Repositories

Yi Liu,Taghi M Khoshgoftaar,Naeem Seliya

doi:10.1109/tse.2010.51

Abstract

A novel search-based approach to software quality modeling with multiple software project repositories is presented. Training a software quality model with only one software measurement and defect data set may not effectively encapsulate quality trends of the development organization. The inclusion of additional software projects during the training process can provide a cross-project perspective on software quality modeling and prediction. The genetic-programming-based approach includes three strategies for modeling with multiple software projects: Baseline Classifier, Validation Classifier, and Validation-and-Voting Classifier. The latter is shown to provide better generalization and more robust software quality models. This is based on a case study of software metrics and defect data from seven real-world systems. A second case study considers 17 different (nonevolutionary) machine learners for modeling with multiple software data sets. Both case studies use a similar majority-voting approach for predicting fault-proneness class of program modules. It is shown that the total cost of misclassification of the search-based software quality models is consistently lower than those of the non-search-based models. This study provides clear guidance to practitioners interested in exploiting their organization's software measurement data repositories for improved software quality modeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evolutionary Optimization of Software Quality Modeling with Multiple Repositories

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering

Lead the way for us

Journal: IEEE Transactions on Software Engineering	Publication Date: Nov 1, 2010
Citations: 168

Similar Papers

Software quality estimation with limited fault data: a semi-supervised learning perspective
Naeem Seliya ... Taghi M Khoshgoftaar
Software Quality Journal | VOL. 15
Naeem Seliya, et. al.Naeem Seliya ... Taghi M Khoshgoftaar
10 Aug 2007
Software Quality Journal | VOL. 15

Software Quality Modeling as a Reliability Tool
Naeem Seliya ... Taghi M Khoshgoftaar
-
Naeem Seliya, et. al.Naeem Seliya ... Taghi M Khoshgoftaar
15 Sep 2008
15 Sep 2008

Empirical Analysis of Quality Models in Practice in Small IT Companies in SEE Region
Florinda Imeri ... Mentor Hamiti
Procedia - Social and Behavioral Sciences | VOL. 191
Florinda Imeri, et. al.Florinda Imeri ... Mentor Hamiti
01 Jun 2015
Procedia - Social and Behavioral Sciences | VOL. 191

Software Quality Modeling with Limited Apriori Defect Data
Naeem Seliya ... Taghi M Khoshgoftaar
-
Naeem Seliya, et. al.Naeem Seliya ... Taghi M Khoshgoftaar
01 Jan 2007
01 Jan 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evolutionary Optimization of Software Quality Modeling with Multiple Repositories

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Software Engineering