Ensemble MultiBoost Based on RIPPER Classifier for Prediction of Imbalanced Software Defect Data

Haitao He,Jiaxin Liu,Xiaolin Zhao,Xu Zhang,Jiadong Ren,Qian Wang,Yongqiang Cheng

doi:10.1109/access.2019.2934128

Haitao He, Jiaxin Liu + Show 5 more

Open Access

https://doi.org/10.1109/access.2019.2934128

Copy DOI

Abstract

Identifying defective software entities is essential to ensure software quality during software development. However, the high dimensionality and class distribution imbalance of software defect data seriously affect software defect prediction performance. In order to solve this problem, this paper proposes an E nsemble M ultiBoost based on R IPPER classifier for prediction of imbalanced S oftware D efect data, called EMR_SD . Firstly, the algorithm uses principal component analysis (PCA) method to find out the most effective features from the original features of the data set, so as to achieve the purpose of dimensionality reduction and redundancy removal. Furthermore, the combined sampling method of adaptive synthetic sampling (ADASYN) and random sampling without replacement is performed to solve the problem of data class imbalance. This classifier establishes association rules based on attributes and classes, using MultiBoost to reduce deviation and variance, so as to achieve the purpose of reducing classification error. The proposed prediction model is evaluated experimentally on the NASA MDP public datasets and compared with existing similar algorithms. The results show that EMR_SD algorithm is superior to DNC, CEL and other defect prediction techniques in most evaluation indicators, which proves the effectiveness of the algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 50	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Ensemble MultiBoost Based on RIPPER Classifier for Prediction of Imbalanced Software Defect Data

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble MultiBoost Based on RIPPER Classifier for Prediction of Imbalanced Software Defect Data

Abstract

Talk to us

Similar Papers

More From: IEEE Access