Abstract

Correlated quality metrics extracted from a source code repository can be utilized to design a model to automatically predict defects in a software system. It is obvious that the extracted metrics will result in a highly unbalanced data, since the number of defects in a good quality software system should be far less than the number of normal instances. It is also a fact that the selection of the best discriminating features significantly improves the robustness and accuracy of a prediction model. Therefore, the contribution of this paper is twofold, first it selects the best discriminating features that help in accurately predicting a defect in a software component. Secondly, a cost-sensitive logistic regression and decision tree ensemble-based prediction models are applied to the best discriminating features for precisely predicting a defect in a software component. The proposed models are compared with the most recent schemes in the literature in terms of accuracy, area under the curve, and recall. The models are evaluated using 11 datasets and it is evident from the results and analysis that the performance of the proposed prediction models outperforms the schemes in the literature.

Highlights

  • The development of high performance and efficient software systems is increasing day by day

  • We propose best features-based cost-sensitive logistic regression (CLR) and decision trees ensemble (CDTE) models to predict software defects

  • The results show that the proposed technique improves the performance by minimizing the root mean squared error up to 50% and improves the stability of the prediction model

Read more

Summary

Introduction

The development of high performance and efficient software systems is increasing day by day. This efficiency is achieved at the cost of software complexity. Analysing these complex software systems manually is a difficult, tedious and costly process Rathore and Kumar (2017). To overcome this difficulty automatic software defect prediction can play a vital role. Extended author information available on the last page of the article. Automated Software Engineering (2021) 28:11 defect prediction of large and complex systems is a challenging task, it can be accomplished by utilising correlated quality metrics.

Objectives
Methods
Findings
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.