An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Jiri Gesi,Iftekhar Ahmed,Jiawei Li

doi:10.1145/3475716.3475791

Abstract

Background: Just-In-Time (JIT) defect prediction models predict if a commit will introduce defects in the future. DeepJIT and CC2Vec are two state-of-the-art JIT Deep Learning (DL) techniques. Usually, defect prediction techniques are evaluated, treating all training data equally. However, data is usually imbalanced not only in terms of the overall class label (e.g., defect and non-defect) but also in terms of characteristics such as File Count, Edit Count, Multiline Comments, Inward Dependency Sum etc. Prior research has investigated the impact of class imbalance on prediction technique's performance but not the impact of imbalance of other characteristics. Aims: We aim to explore the impact of different commit related characteristic's imbalance on DL defect prediction. Method: We investigated different characteristic's impact on the overall performance of DeepJIT and CC2Vec. We also propose a Siamese network based few-shot learning framework for JIT defect prediction (SifterJIT) combining Siamese network and DeepJIT. Results: Our results show that DeepJIT and CC2Vec lose out on the performance by around 20% when trained and tested on imbalanced data. However, SifterJIT can outperform state-of-the-art DL techniques with an average of 8.65% AUC score, 11% precision, and 6% F1-score improvement. Conclusions: Our results highlight that dataset imbalanced in terms of commit characteristics can significantly impact prediction performance, and few-shot learning based techniques can help alleviate the situation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Learning for Software Defect Prediction: An LSTM-based Approach
Et Al Prashant Sahatiya
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Et Al Prashant SahatiyaEt Al Prashant Sahatiya
05 Nov 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Deep Learning for Software Defect Prediction: An LSTM-based Approach
Et Al Prashant Sahatiya
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Et Al Prashant SahatiyaEt Al Prashant Sahatiya
05 Nov 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

COVID‐19: A systematic review of prediction and classification techniques
Om Ramakisan Varma ... Mala Kalra
International Journal of Imaging Systems and Technology | VOL. 33
Om Ramakisan Varma, et. al.Om Ramakisan Varma ... Mala Kalra
11 May 2023
International Journal of Imaging Systems and Technology | VOL. 33

Early Diagnosis of Brain Tumour MRI Images Using Hybrid Techniques between Deep and Machine Learning.
Ebrahim Mohammed Senan ... Mukti E Jadhav
Computational and Mathematical Methods in Medicine | VOL. 2022
Ebrahim Mohammed Senan, et. al.Ebrahim Mohammed Senan ... Mukti E Jadhav
18 May 2022
Computational and Mathematical Methods in Medicine | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Empirical Examination of the Impact of Bias on Just-in-time Defect Prediction

Abstract

Talk to us

Similar Papers