A Three-Level Training Data Filter for Cross-project Defect Prediction

Cangzhou Yuan,Xiaowei Wang,Panpan Zhan,Xinxin Ke

doi:10.1007/978-3-030-69069-4_10

Abstract

The purpose of cross-project defect prediction is to predict whether there are defects in this project module by using a prediction model trained by the data of other projects. For the divergence of the data distribution between different projects, the performance of cross-project defect prediction is not as good as within-project defect prediction. To reduce the difference as much as possible, researchers have proposed a variety of methods to filter training data from the perspective of transfer learning. In this paper, we introduce a “project-instance-metric" hierarchical filtering strategy to select training data for the defect prediction model. Using the three-level filtering method, the candidate projects that are most similar to the target project, the instances that are most similar to the target instance, and the metrics with the highest correlation to the prediction result are filtered out respectively. We compared three-level filtering with project-level filtering, instance-level filtering, and the combination of project-level and instance-level filtering methods in four classification algorithms using NASA open source data sets. Our experiments show that the three-level filtering method achieves more significant f-measure and AUC values than the single level training data filtering method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Three-Level Training Data Filter for Cross-project Defect Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Cross-Project Defect Prediction with Weighted Software Modules via Transfer Learning
Yizhou Chen ... Heng Dai
Journal of Physics: Conference Series | VOL. 2025
Yizhou Chen, et. al.Yizhou Chen ... Heng Dai
01 Sep 2021
Journal of Physics: Conference Series | VOL. 2025

Improving Cross-Project Software Defect Prediction Method Through Transformation and Feature Selection Approach
Yahaya Zakariyau Bala ... Pathiah Abdul Samat
IEEE Access | VOL. 11
Yahaya Zakariyau Bala, et. al.Yahaya Zakariyau Bala ... Pathiah Abdul Samat
01 Jan 2023
IEEE Access | VOL. 11

Defect prediction by using cluster ensembles
Yanhong Yang ... Jun Yang
-
Yanhong Yang, et. al.Yanhong Yang ... Jun Yang
01 Mar 2018
01 Mar 2018

Comparing Hyperparameter Optimization in Cross- and Within-Project Defect Prediction: A Case Study
Muhammed Maruf Öztürk
Arabian Journal for Science and Engineering | VOL. 44
Muhammed Maruf ÖztürkMuhammed Maruf Öztürk
28 Sep 2018
Arabian Journal for Science and Engineering | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Three-Level Training Data Filter for Cross-project Defect Prediction

Abstract

Talk to us

Similar Papers