Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering

Shiqi Tang,Haijin Ji,Yongming Yao,Kaishun Wu,Erhu Liu,Song Huang

doi:10.1155/2022/6339684

Abstract

Software defect prediction (SDP) is an important technology which is widely applied to improve software quality and reduce development costs. It is difficult to train the SDP model when software to be test only has limited historical data. Cross-project defect prediction (CPDP) has been proposed to solve this problem by using source project data to train the defect prediction model. Most of CPDP methods build defect prediction models based on the similarity of feature space or data distance between different projects. However, when the target project has a small amount of label data, these methods usually do not consider this part of data information. Therefore, when the distribution between source project and target project is quite different, these methods are difficult to achieve good prediction performance. To solve this problem, this paper proposes a CPDP method based on a semisupervised clustering (namely, Tsbagging). Tsbagging has two stages; in the first stage, we cluster to the source project data based on the limited labeled data in the target project and assign different weights to these source project data according to the clustering results. In the second stage, we use bagging method to train the prediction model based on the weight assigned in the first stage. The experimental results show that the performance achieved by Tsbagging is better than other existing SDP methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering

Abstract

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Journal: Scientific Programming	Publication Date: Sep 28, 2022
License type: cc-by

Similar Papers

Deep Semantic Feature Learning for Software Defect Prediction
Song Wang ... Jaechang Nam
IEEE Transactions on Software Engineering | VOL. 46
Song Wang, et. al.Song Wang ... Jaechang Nam
01 Dec 2020
IEEE Transactions on Software Engineering | VOL. 46

Simplify Your Neural Networks: An Empirical Study on Cross-Project Defect Prediction
Ruchika Malhotra ... Abuzar Ahmed Khan
-
Ruchika Malhotra, et. al.Ruchika Malhotra ... Abuzar Ahmed Khan
14 Sep 2021
14 Sep 2021

Software defect prediction via transfer learning based neural network
Qimeng Cao ... Qing Sun
-
Qimeng Cao, et. al.Qimeng Cao ... Qing Sun
01 Oct 2015
01 Oct 2015

Software defect prediction with semantic and structural information of codes based on Graph Neural Networks
Chunying Zhou ... Peng He
Information and Software Technology | VOL. 152
Chunying Zhou, et. al.Chunying Zhou ... Peng He
01 Dec 2022
Information and Software Technology | VOL. 152

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering

Abstract

Talk to us

Similar Papers

More From: Scientific Programming