An Improved and Optimized Random Forest Based Approach to Predict the Software Faults

Nikhil Saji Thomas,S Kaliraj

doi:10.1007/s42979-024-02764-x

Abstract

Effective software fault prediction is crucial for minimizing errors during software development and preventing subsequent failures. This research introduces an enhanced Random Forest-based approach for predicting software faults, specifically focusing on the NASA JM1 dataset. The dataset comprises 21 software metrics indicating the presence or absence of faults in a module, and it is utilized to evaluate the proposed approach. The study delves into the intricacies of the NASA dataset, detailing the cleaning process and addressing class imbalance through Synthetic Minority Over-sampling Technique (SMOTE). The core of our approach involves the implementation and fine-tuning of the Random Forest classifier, with a specific focus on optimizing hyperparameters to enhance predictive accuracy. In comparative evaluations with standard machine learning models, our proposed approach demonstrated superior performance, achieving an accuracy of 82.96% and an F1 score of 89.53%. Notably, we emphasize the significance of software defects and their potential to cause failures and crashes during software development, leading to substantial organizational losses. The paper provides a comprehensive examination of different aspects of the machine learning model, offering detailed insights, examples, and illustrative figures to enhance the understanding of our proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Improved and Optimized Random Forest Based Approach to Predict the Software Faults

Abstract

Talk to us

Similar Papers

More From: SN Computer Science

Lead the way for us

Journal: SN Computer Science	Publication Date: May 9, 2024
License type: CC BY 4.0

Similar Papers

A novel approach for software defect prediction using CNN and GRU based on SMOTE Tomek method
Nasraldeen Alnor Adam Khleel ... Károly Nehéz
Journal of Intelligent Information Systems | VOL. 60
Nasraldeen Alnor Adam Khleel, et. al.Nasraldeen Alnor Adam Khleel ... Károly Nehéz
16 May 2023
Journal of Intelligent Information Systems | VOL. 60

Hybrid Sampling and Random Forest Based Machine Learning Approach for Software Defect Prediction
Md Anwar Hossen ... Nurhafizah Abu Talip Yusof
-
Md Anwar Hossen, et. al.Md Anwar Hossen ... Nurhafizah Abu Talip Yusof
01 Jan 2020
01 Jan 2020

Software Defect Prediction Based on Optimized Machine Learning Models: A Comparative Study
Muhammad Zain Fawwaz Nuruddin Siswantoro ... Umi Laili Yuhana
Teknika | VOL. 12
Muhammad Zain Fawwaz Nuruddin Siswantoro, et. al.Muhammad Zain Fawwaz Nuruddin Siswantoro ... Umi Laili Yuhana
30 Jun 2023
Teknika | VOL. 12

Software Fault Dataset
Sandeep Kumar ... Santosh Singh Rathore
-
Sandeep Kumar, et. al.Sandeep Kumar ... Santosh Singh Rathore
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved and Optimized Random Forest Based Approach to Predict the Software Faults

Abstract

Talk to us

Similar Papers

More From: SN Computer Science