Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

Ying Sun,Fei Wu,Danlei Xing,Yanfei Sun,Juanjuan Li,Haowen Chen,Xiao-Yuan Jing

doi:10.1109/access.2020.2974527

Abstract

Cross-project defect prediction (CPDP) aims to build a prediction model on existing source projects and predict the labels of target project. The data distribution difference between different projects makes CPDP very challenging. Besides, most existing CPDP methods usually require sufficient and labeled data. However, acquiring lots of labeled data for a new project is difficult while obtaining the unlabeled data is relatively easy. A desirable approach is building a prediction model on unlabeled data and labeled data. CPDP in this scenario is called cross-project semi-supervised defect prediction (CSDP). Recently, generative adversarial networks have achieved impressive results with these strong ability of learning data distribution and discriminative representation. For effectively learning the discriminative features of data from different projects, we propose a Discriminative Adversarial Feature Learning (DAFL) approach for CSDP. DAFL consists of feature transformer and project discriminator, which compete with each other. A feature transformer tries to generate feature representation, which learns the discriminant information and preserves intrinsic structure inferred from both labeled and unlabeled data. A project discriminator tries to discriminate source and target instances on the generated representation. Experiments on 16 projects show that DAFL performs significantly better than baselines.

Highlights

Software defect prediction (SDP) [1]–[8] is an important software quality assurance step of predicting the defectproneness in software project development history
When we do not have sufficient amount of historical data, cross-project defect prediction (CPDP) [23] is a satisfactory solution, which refers to building the prediction model trained by the data from source projects and predicting the label of a target project
In order to address the challenges of distribution difference between different projects and limited number of labeled data, we propose a new approach, termed Discriminative Adversarial Feature Learning (DAFL) for cross-project semi-supervised defect prediction (CSDP)

Summary

Introduction

Software defect prediction (SDP) [1]–[8] is an important software quality assurance step of predicting the defectproneness in software project development history. Many prior SDP studies predict the fault of a new instance within the same project, which is called within-project defect prediction (WPDP) [9]–[13]. The associate editor coordinating the review of this manuscript and approving it for publication was Zhaojun Li. studies have shown that a useful machine learning model needs to be trained by using sufficient and complete data. It is a challenging problem that a new project with limited historical data could perform the prediction model well. When we do not have sufficient amount of historical data, cross-project defect prediction (CPDP) [23] is a satisfactory solution, which refers to building the prediction model trained by the data from source projects and predicting the label of a target project

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Suitable AST Node Granularity and Multi-Kernel Transfer Convolutional Neural Network for Cross-Project Defect Prediction
Jiehan Deng ... Lu Lu
IEEE Access | VOL. 8
Jiehan Deng, et. al.Jiehan Deng ... Lu Lu
01 Jan 2020
IEEE Access | VOL. 8

Cross-project defect prediction based on G-LSTM model
Ying Xing ... Yu Guan
Pattern Recognition Letters | VOL. 160
Ying Xing, et. al.Ying Xing ... Yu Guan
01 Aug 2022
Pattern Recognition Letters | VOL. 160

Cross-Project Defect Prediction Based on Domain Adaptation and LSTM Optimization
Khadija Javed ... Ren Shengbing
Algorithms | VOL. 17
Khadija Javed, et. al.Khadija Javed ... Ren Shengbing
24 Apr 2024
Algorithms | VOL. 17

Cross-Project Defect Prediction with Metrics Selection and Balancing Approach
Meetesh Nevendra ... Pradeep Singh
Applied Computer Systems | VOL. 27
Meetesh Nevendra, et. al.Meetesh Nevendra ... Pradeep Singh
01 Dec 2022
Applied Computer Systems | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Learning for Cross-Project Semi-Supervised Defect Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access