Software Defect Datasets Research Articles

PurposeSoftware defect prediction (SDP) is a critical aspect of software quality assurance, aiming to identify and manage potential defects in software systems. In this paper, we have proposed a novel hybrid approach that combines Grey Wolf Optimization with Feature Selection (GWOFS) and multilayer perceptron (MLP) for SDP. The GWOFS-MLP hybrid model is designed to optimize feature selection, ultimately enhancing the accuracy and efficiency of SDP. Grey Wolf Optimization, inspired by the social hierarchy and hunting behavior of grey wolves, is employed to select a subset of relevant features from an extensive pool of potential predictors. This study investigates the key challenges that traditional SDP approaches encounter and proposes promising solutions to overcome time complexity and the curse of the dimensionality reduction problem.Design/methodology/approachThe integration of GWOFS and MLP results in a robust hybrid model that can adapt to diverse software datasets. This feature selection process harnesses the cooperative hunting behavior of wolves, allowing for the exploration of critical feature combinations. The selected features are then fed into an MLP, a powerful artificial neural network (ANN) known for its capability to learn intricate patterns within software metrics. MLP serves as the predictive engine, utilizing the curated feature set to model and classify software defects accurately.FindingsThe performance evaluation of the GWOFS-MLP hybrid model on a real-world software defect dataset demonstrates its effectiveness. The model achieves a remarkable training accuracy of 97.69% and a testing accuracy of 97.99%. Additionally, the receiver operating characteristic area under the curve (ROC-AUC) score of 0.89 highlights the model’s ability to discriminate between defective and defect-free software components.Originality/valueExperimental implementations using machine learning-based techniques with feature reduction are conducted to validate the proposed solutions. The goal is to enhance SDP’s accuracy, relevance and efficiency, ultimately improving software quality assurance processes. The confusion matrix further illustrates the model’s performance, with only a small number of false positives and false negatives.

Read full abstract

Software Defect Datasets Research Articles

Related Topics

Articles published on Software Defect Datasets

Bug numbers matter: An empirical study of effort‐aware defect prediction using class labels versus bug numbers

Hybrid feature selection method for predicting software defect

Understanding the Impact of Changes in Application Characteristics on SRGM

A hybrid approach for optimizing software defect prediction using a grey wolf optimization and multilayer perceptron

Software Fault Prediction Using Optimal Classifier Selection: An Ensemble Approach

Leveraging Ensemble Learning with Generative Adversarial Networks for Imbalanced Software Defects Prediction

A software defect prediction method based on learnable three-line hybrid feature fusion

Privacy Protection Optimization for Federated Software Defect Prediction via Benchmark Analysis

Improving Software Defect Prediction in Noisy Imbalanced Datasets

A Hybrid Software Defects Prediction Model for Imbalance Datasets Using Machine Learning Techniques: (S-SVM Model)

Class Imbalance Reduction and Centroid based Relevant Project Selection for Cross Project Defect Prediction

Concept Drift in Software Defect Prediction: A Method for Detecting and Handling the Drift

Efficient Random Forest Algorithm for Multi-objective Optimization in Software Defect Prediction

A GABP based method to improved software defect prediction

Improving Cross-Project Software Defect Prediction Method Through Transformation and Feature Selection Approach

Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System

Cloud-based bug tracking software defects analysis using deep learning

A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction

Generative Oversampling Methods for Handling Imbalanced Data in Software Fault Prediction

Software defect prediction based on nested-stacking and heterogeneous feature selection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Software Defect Datasets Research Articles

Related Topics

Articles published on Software Defect Datasets

Bug numbers matter: An empirical study of effort‐aware defect prediction using class labels versus bug numbers

Hybrid feature selection method for predicting software defect

Understanding the Impact of Changes in Application Characteristics on SRGM

A hybrid approach for optimizing software defect prediction using a grey wolf optimization and multilayer perceptron

Software Fault Prediction Using Optimal Classifier Selection: An Ensemble Approach

Leveraging Ensemble Learning with Generative Adversarial Networks for Imbalanced Software Defects Prediction

A software defect prediction method based on learnable three-line hybrid feature fusion

Privacy Protection Optimization for Federated Software Defect Prediction via Benchmark Analysis

Improving Software Defect Prediction in Noisy Imbalanced Datasets

A Hybrid Software Defects Prediction Model for Imbalance Datasets Using Machine Learning Techniques: (S-SVM Model)

Class Imbalance Reduction and Centroid based Relevant Project Selection for Cross Project Defect Prediction

Concept Drift in Software Defect Prediction: A Method for Detecting and Handling the Drift

Efficient Random Forest Algorithm for Multi-objective Optimization in Software Defect Prediction

A GABP based method to improved software defect prediction

Improving Cross-Project Software Defect Prediction Method Through Transformation and Feature Selection Approach

Data and Ensemble Machine Learning Fusion Based Intelligent Software Defect Prediction System

Cloud-based bug tracking software defects analysis using deep learning

A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction

Generative Oversampling Methods for Handling Imbalanced Data in Software Fault Prediction

Software defect prediction based on nested-stacking and heterogeneous feature selection