Prediction of Cross Project Defects using Ensemble based Multinomial Classifier

Lipika Goel,Mayank Sharma,D Damodaran,Sunil Khatri

doi:10.4108/eai.13-7-2018.159974

Lipika Goel, Mayank Sharma + Show 2 more

Open Access

PDF Available

https://doi.org/10.4108/eai.13-7-2018.159974

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BACKGROUND: The availability of defect related data of different projects leads to cross project defect prediction an open issue. Many studies have focused on analyzing and improving the performance of Cross project defect prediction. OBJECTIVE: The multinomial classification has not been much explored. This paper instanced on multiclass/multinomial classification of defect prediction of cross projects. METHOD: The ensemble based statistical models – Gradient Boosting and Random Forest are used for classification. An empirical study is carried out to determine the performance of multinomial classification for cross project defect prediction. Depending on the number of defects, class level information is classified into one of three defined multiclass class 0, class 1, and class 2. RESULTS & CONCLUSION: Major outcome of the paper concludes that multinomial/multiclass classification is applicable on cross project data and has comparable results to within project defect data.

Highlights

Identification of the defect prone classes before actual testing reduces the testing cost and efforts
End for # Defining the training and the testing data for Cross Project Defect Prediction for project belongs to CP_test CP_train = CP_data – data CP_test= data # Defining the training and the testing data for Within Project Defect Prediction for project belongs to WP_test splitting training and testing data with 60:40 ratio train = {selecting 60% of data us } test = training_data – train #Modeling applying machine learning algorithms model1 = random_forest_model_training(train) model2 = gradient_boosting_model_training(train) Applying Grid Search technique to tune the hyper-parameters for both models Predicting the results on test data Conversion of 10-class to 3-class and re-train the models again with above steps Evaluating the models using metrics {auc,precision,recall, f1score}
In this paper we analyzed the performance in multinomial classification of cross and within project defect prediction

Summary

OBJECTIVE

The multinomial classification has not been much explored. This paper instanced on multiclass/multinomial classification of defect prediction of cross projects. METHOD: The ensemble based statistical models – Gradient Boosting and Random Forest are used for classification. An empirical study is carried out to determine the performance of multinomial classification for cross project defect prediction. Depending on the number of defects, class level information is classified into one of three defined multiclass class 0, class 1, and class 2. RESULTS & CONCLUSION: Major outcome of the paper concludes that multinomial/multiclass classification is applicable on cross project data and has comparable results to within project defect data. Received on 10 May 2019, accepted on 27 August 2019, published on 09 September 2019

Introduction

State Of Art

Metrics used and description of Datasets

Ensemble Learning Models

Evaluation Measures

Data Preprocessing and Preparation

Model Fitting

Model Evaluation

Performance Results

Results & Discussion

Threats To Validity

Conclusion & Future Scope

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ICST Transactions on Scalable Information Systems	Publication Date: Jul 13, 2018
Citations: 1	License type: cc-by

R Discovery Prime

Prediction of Cross Project Defects using Ensemble based Multinomial Classifier

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: ICST Transactions on Scalable Information Systems

Lead the way for us

Similar Papers

Substantiation of multinomial classification using ensemble learning approach
Lipika Goel ... Sunil Kumar Khatri
IOP Conference Series: Materials Science and Engineering | VOL. 618
Lipika Goel, et. al.Lipika Goel ... Sunil Kumar Khatri
01 Oct 2019
IOP Conference Series: Materials Science and Engineering | VOL. 618

Improving transfer learning for software cross-project defect prediction
Osayande P Omondiagbe ... Sherlock A Licorish
Applied Intelligence | VOL. 54
Osayande P Omondiagbe, et. al.Osayande P Omondiagbe ... Sherlock A Licorish
01 Apr 2024
Applied Intelligence | VOL. 54

Boosted Relief Feature Subset Selection and Heterogeneous Cross Project Defect Prediction using Firefly Particle Swarm Optimization
Mrs.N Kalavani* ... Dr.R Beena
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 8
Mrs.N Kalavani*, et. al.Mrs.N Kalavani* ... Dr.R Beena
30 Jan 2020
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 8

An empirical analysis of the statistical learning models for different categories of cross-project defect prediction
Mayank Sharma ... Sunil Kumar Khatri
International Journal of Computer Aided Engineering and Technology | VOL. 14
Mayank Sharma, et. al.Mayank Sharma ... Sunil Kumar Khatri
01 Jan 2020
International Journal of Computer Aided Engineering and Technology | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Prediction of Cross Project Defects using Ensemble based Multinomial Classifier

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: ICST Transactions on Scalable Information Systems