Interpretability application of the Just-in-Time software defect prediction model

Wei Zheng,Tianren Shen,Xiang Chen,Peiran Deng

doi:10.1016/j.jss.2022.111245

Abstract

Software defect prediction is one of the most active fields in software engineering. Recently, some experts have proposed the Just-in-time Defect Prediction Technology. Just-in-time Defect prediction technology has become a hot topic in defect prediction due to its directness and fine granularity. This technique can predict whether a software defect exists in every code change submitted by a developer. In addition, the method has the advantages of high speed and easy tracking. However, the biggest challenge is that the prediction accuracy of Just-in-Time software is affected by the data set category imbalance. In most cases, 20% of defects in software engineering may be in 80% of modules, and code changes that do not cause defects account for a large proportion. Therefore, there is an imbalance in the data set, that is, the imbalance between a few classes and a majority of classes, which will affect the classification prediction effect of the model. Furthermore, because most features do not result in code changes that cause defects, it is not easy to achieve the desired results in practice even though the model is highly predictive. In addition, the features of the data set contain many irrelevant features and redundant features, which are invalid data, which will increase the complexity of the prediction model and reduce the prediction efficiency. To improve the prediction efficiency of Just-in-Time defect prediction technology. We trained a just-in-time defect prediction model using six open source projects from different fields based on random forest classification. LIME Interpretability technique is used to explain the model to a certain extent. By using explicable methods to extract meaningful, relevant features, the experiment can only need 45% of the original work to explain the prediction results of the prediction model and identify critical features through explicable techniques, and only need 96% of the original work to achieve this goal, under the premise of ensuring specific prediction effects. Therefore, the application of interpretable techniques can significantly reduce the workload of developers and improve work efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interpretability application of the Just-in-Time software defect prediction model

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software

Lead the way for us

Journal: Journal of Systems and Software	Publication Date: Feb 3, 2022
Citations: 50

Similar Papers

Just-in-Time Defect Prediction Technology based on Interpretability Technology
Wei Zheng ... Tianren Shen
-
Wei Zheng, et. al.Wei Zheng ... Tianren Shen
01 Aug 2021
01 Aug 2021

Is Open-Source Software Valuable for Software Defect Prediction of Proprietary Software and Vice Versa?
Misha Kakkar ... P S Grover
-
Misha Kakkar, et. al.Misha Kakkar ... P S Grover
25 Nov 2017
25 Nov 2017

Towards a framework for reliable performance evaluation in defect prediction
Xutong Liu ... Yuming Zhou
Science of Computer Programming | VOL. 238
Xutong Liu, et. al.Xutong Liu ... Yuming Zhou
12 Jun 2024
Science of Computer Programming | VOL. 238

The Empirical Study of Semi-Supervised Deep Fuzzy C-Mean Clustering for Software Fault Prediction
Ali Arshad ... Licheng Jiao
IEEE Access | VOL. 6
Ali Arshad, et. al.Ali Arshad ... Licheng Jiao
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interpretability application of the Just-in-Time software defect prediction model

Abstract

Talk to us

Similar Papers

More From: Journal of Systems and Software