Software Fault Prediction Using Optimal Classifier Selection: An Ensemble Approach

Bikash Agrawalla,B Ramachandra Reddy

doi:10.1016/j.procs.2024.04.280

Abstract

Fault prediction is the process of using data analysis and machine learning models to anticipate potential defects or faults in the software system. Using only the base machine learning models for software fault prediction leads to limited performance, difficulty in handling non-linear relationships and imbalanced data, inadequate feature representation, and limited complexity handling. Hence, in order to overcome these challenges, this paper proposes a new technique for the selection of classifiers that forms a heterogeneous ensemble. The main goal is to remove or trim out the classifiers that show poor performance compared to the other base classifiers, which can result into a more effective ensemble and can produce better results. The algorithm proposed in this paper finds a set of classifiers that can perform better than using all the classifiers. The challenge that was faced was how to identify the poor-performing classifiers. This challenge is dealt with by performing an experiment using different threshold values to choose the trimmed set of classifiers. For evaluation of the proposed model, 8 different benchmark software fault datasets were used, which are taken from PROMISE and the Apache repository, and AUC is used as the performance measure. The results obtained after the experimental analysis demonstrate the effectiveness of our algorithm compared to the traditional approaches, which used all the base classifiers. There is a significant increase in the AUC values for 6 datasets out of 8, while using the average of probabilities and majority voting, it was seen that there is improvement in 7 out of 8 datasets used. The best-performing dataset by using the average of probabilities is ARC, where the AUC values increase from 0.6505 to 0.694, and while using majority voting, the best-performing dataset is XALAN, where the AUC values increase from 0.5455 to 0.679. From this, it can be seen that the proposed ensemble approach achieved higher AUC values for the tested datasets when compared to the base machine learning classifiers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Software Fault Prediction Using Optimal Classifier Selection: An Ensemble Approach

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2024
License type: cc-by-nc-nd

Similar Papers

Availability and reporting quality of external validations of machine-learning prediction models with orthopedic surgical outcomes: a systematic review
Olivier Q Groot ... Joseph H Schwab
Acta Orthopaedica | VOL. 92
Olivier Q Groot, et. al.Olivier Q Groot ... Joseph H Schwab
18 Apr 2021
Acta Orthopaedica | VOL. 92

100. Availability and reporting quality of external validations of ML prediction models with orthopedic surgical outcomes: A systematic review
Olivier Groot ... Joseph H Schwab
The Spine Journal | VOL. 21
Olivier Groot, et. al.Olivier Groot ... Joseph H Schwab
10 Aug 2021
The Spine Journal | VOL. 21

A computed tomography urography-based machine learning model for predicting preoperative pathological grade of upper urinary tract urothelial carcinoma.
Yanghuang Zheng ... Jinsong Zhang
Cancer medicine | VOL. 13
Yanghuang Zheng, et. al.Yanghuang Zheng ... Jinsong Zhang
01 Jan 2024
Cancer medicine | VOL. 13

Machine Learning Approach Using MLP and SVM Algorithms for the Fault Prediction of a Centrifugal Pump in the Oil and Gas Industry
Pier Francesco Orrù ... Riccardo Cozza
Sustainability | VOL. 12
Pier Francesco Orrù, et. al.Pier Francesco Orrù ... Riccardo Cozza
11 Jun 2020
Sustainability | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Software Fault Prediction Using Optimal Classifier Selection: An Ensemble Approach

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science