Few-Shot Learning Based Balanced Distribution Adaptation for Heterogeneous Defect Prediction

Aili Wang,Haibin Wu,Kaiyuan Jiang,Minhui Wang,Yutong Zhang

doi:10.1109/access.2020.2973924

Aili Wang, Haibin Wu + Show 3 more

Open Access

https://doi.org/10.1109/access.2020.2973924

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 15	License type: CC BY 4.0

Affiliation: Harbin University of Science and Technology

Abstract

Heterogeneous defect prediction (HDP) aims to predict the defect tendency of modules in one project using heterogeneous data collected from other projects. It sufficiently incorporates the two characteristics of the defect prediction data: (1) datasets could have different metrics and distribution, and (2) data could be highly imbalanced. In this paper, we propose a few-shot learning based balanced distribution adaptation (FSLBDA) approach for heterogeneous defect prediction, which takes into consideration the two characteristics of the defect prediction data. Class imbalance of the defect datasets can be solved with undersampling, but the scale of the training datasets will be smaller. Specifically, we first remove redundant metrics of datasets with extreme gradient boosting. Then, we reduce the data difference between the source domain and the target domain with the balanced distribution adaptation. It considers the marginal distribution and the probability of conditional distribution differences and adaptively assigns different weights to them. Finally, we use adaptive boosting to relieve the influence caused by the size of the training dataset is smaller, which can improve the accuracy of the defect prediction model. We conduct experiments on 17 projects from 4 datasets using 3 indicators (i.e., AUC, G-mean, F-measure). Compared to three classic approaches, the experimental results show that FSLBDA can effectively improve the prediction performance.

Highlights

With the availability of massive storage capabilities, high speed Internet, and the advent of Internet of Things devices, modern software systems are growing in both size and complexity [1]
Wang et al proposed a balanced distribution adaptation method (BDA) [20], which can dynamically measure the different effects of marginal distribution and conditional distribution, rather than give them the same weight
CONCLUSIONS & FUTURE WORK In this paper, we introduce BDA to dynamically narrow the gap between marginal distribution and conditional distribution differences of heterogeneous datasets with the balance factor

Summary

Introduction

With the availability of massive storage capabilities, high speed Internet, and the advent of Internet of Things devices, modern software systems are growing in both size and complexity [1]. Subspace learning can reduce data drift during data mapping, but there are still different marginal distribution and conditional distribution in source domain and target domain, which affect the decision result. When there is a big difference between the source domain and the target domain data, marginal distribution adaptation is more important.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Few-Shot Learning Based Balanced Distribution Adaptation for Heterogeneous Defect Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-Source Heterogeneous Kernel Mapping in Software Defect Prediction
Jingxiu Yao ... Zhibo Li
Applied Sciences | VOL. 13
Jingxiu Yao, et. al.Jingxiu Yao ... Zhibo Li
28 Apr 2023
Applied Sciences | VOL. 13

A Data Transfer and Relevant Metrics Matching Based Approach for Heterogeneous Defect Prediction
Pravas Ranjan Bal ... Sandeep Kumar
IEEE Transactions on Software Engineering | VOL. 49
Pravas Ranjan Bal, et. al.Pravas Ranjan Bal ... Sandeep Kumar
01 Mar 2023
IEEE Transactions on Software Engineering | VOL. 49

Heterogeneous Defect Prediction Through Multiple Kernel Learning and Ensemble Learning
Zhiqiang Li ... Xiao-Yuan Jing
-
Zhiqiang Li, et. al.Zhiqiang Li ... Xiao-Yuan Jing
01 Sep 2017
01 Sep 2017

Heterogeneous Defect Prediction through Joint Metric Selection and Matching
Haowen Chen ... Xiao-Yuan Jing
-
Haowen Chen, et. al.Haowen Chen ... Xiao-Yuan Jing
01 Dec 2021
01 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-Shot Learning Based Balanced Distribution Adaptation for Heterogeneous Defect Prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access