Feature Subset Selection and Instance Filtering for Cross-project Defect Prediction - Classification and Ranking

Faimison Porto,Adenilso Da Silva Simao

doi:10.19153/cleiej.19.3.4

Abstract

The defect prediction models can be a good tool on organizing the project´s test resources. The models can be constructed with two main goals: 1) to classify the software parts - defective or not; or 2) to rank the most defective parts in a decreasing order. However, not all companies maintain an appropriate set of historical defect data. In this case, a company can build an appropriate dataset from known external projects - called Cross-project Defect Prediction (CPDP).The CPDP models, however, present low prediction performances due to the heterogeneity of data. Recently, Instance Filtering methods were proposed in order to reduce this heterogeneity by selecting the most similar instances from the training dataset. Originally, the similarity is calculated based on all the available dataset features (or independent variables).We propose that using only the most relevant features on the similarity calculation can result in more accurate filtered datasets and better prediction performances. In this study we extend our previous work. We analyse both prediction goals - Classification and Ranking. We present an empirical evaluation of 41 different methods by associating Instance Filtering methods with Feature Selection methods. We used 36 versions of 11 open source projects on experiments.The results show similar evidences for both prediction goals. First, the defect prediction performance of CPDP models can be improved by associating Feature Selection and Instance Filtering. Second, no evaluated method presented general better performances. Indeed, the most appropriate method can vary according to the characteristics of the project being predicted.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: CLEI Electronic Journal	Publication Date: Dec 1, 2016
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Feature Subset Selection and Instance Filtering for Cross-project Defect Prediction - Classification and Ranking

Abstract

Talk to us

Similar Papers

More From: CLEI Electronic Journal

Lead the way for us

Similar Papers

Using Bandit Algorithms for Project Selection in Cross-Project Defect Prediction
Takuya Asano ... Akito Monden
-
Takuya Asano, et. al.Takuya Asano ... Akito Monden
01 Sep 2021
01 Sep 2021

How Far We Have Progressed in the Journey? An Examination of Cross-Project Defect Prediction
Yuming Zhou ... Yangyang Zhao
ACM Transactions on Software Engineering and Methodology | VOL. 27
Yuming Zhou, et. al.Yuming Zhou ... Yangyang Zhao
31 Jan 2018
ACM Transactions on Software Engineering and Methodology | VOL. 27

Comparing Hyperparameter Optimization in Cross- and Within-Project Defect Prediction: A Case Study
Muhammed Maruf Öztürk
Arabian Journal for Science and Engineering | VOL. 44
Muhammed Maruf ÖztürkMuhammed Maruf Öztürk
28 Sep 2018
Arabian Journal for Science and Engineering | VOL. 44

Towards Privacy Preserving Cross Project Defect Prediction with Federated Learning
Hiroki Yamamoto ... Yasutaka Kamei
-
Hiroki Yamamoto, et. al.Hiroki Yamamoto ... Yasutaka Kamei
01 Mar 2023
01 Mar 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Subset Selection and Instance Filtering for Cross-project Defect Prediction - Classification and Ranking

Abstract

Talk to us

Similar Papers

More From: CLEI Electronic Journal