SLA+: Narrowing the Difference between Data Sets in Heterogenous Cross-Project Defection Prediction

Jie Wu,Min Zhou,Yingbo Wu,Xiaoling Jiang

doi:10.1109/compsac48688.2020.00-88

Abstract

Different from existing cross-project defection prediction(CPDP) problems which assume that there is a close relation between the source data sets and the target data sets, in the heterogenous cross-project defection prediction(HCPDP) problem, the target data sets can be totally different from the source data sets. In order to narrow the difference between source data sets and target data sets, we implemented our own algorithm SLA + based on the selective learning algorithm . We select one of the multiple sources that have the highest similarity to the target data set as the source data set, and select one or more of the other source data sets that are similar to both the target data set and the source data set as an intermediate domain. We set up a bridge between the target domain and the source domain through the intermediate domain , breaking the large distribution gap for transferring knowledge between the source domain and the target domain. Besides, we achieve the purpose of dimensionality reduction by mining the potential relationship between features. We have done experiments on open source data sets, and the data sets used are all heterogeneous. The experiments prove that our method achieves comparable results compared with state-of-the-art HCPDP in most cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SLA+: Narrowing the Difference between Data Sets in Heterogenous Cross-Project Defection Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multiple-level biomedical event trigger recognition with transfer learning
Yifei Chen
BMC Bioinformatics | VOL. 20
Yifei ChenYifei Chen
06 Sep 2019
BMC Bioinformatics | VOL. 20

Unsupervised Adaptation Across Domain Shifts by Generating Intermediate Data Representations
Raghuraman Gopalan ... Rama Chellappa
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 36
Raghuraman Gopalan, et. al.Raghuraman Gopalan ... Rama Chellappa
01 Nov 2014
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 36

A transfer learning model with multi-source domains for biomedical event trigger extraction
Yifei Chen
BMC Genomics | VOL. 22
Yifei ChenYifei Chen
07 Jan 2021
BMC Genomics | VOL. 22

A Data Transfer and Relevant Metrics Matching Based Approach for Heterogeneous Defect Prediction
Pravas Ranjan Bal ... Sandeep Kumar
IEEE Transactions on Software Engineering | VOL. 49
Pravas Ranjan Bal, et. al.Pravas Ranjan Bal ... Sandeep Kumar
01 Mar 2023
IEEE Transactions on Software Engineering | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SLA+: Narrowing the Difference between Data Sets in Heterogenous Cross-Project Defection Prediction

Abstract

Talk to us

Similar Papers