Quality Of Software Projects Research Articles

Abstract Software bug prediction (SBP) involves identifying or categorizing software modules likely to contain defects, utilizing underlying system properties such as software metrics. SBP plays a crucial role in enhancing software project quality and mitigating maintenance risks. Numerous machine learning (ML) algorithms have been developed to predict software bugs. Class imbalance poses a significant challenge for these algorithms, significantly impeding their effectiveness and resulting in imbalanced false-positive and false-negative outcomes. However, limited research has been conducted to specifically tackle the issue of class imbalance in the context of SBP. This study investigates the prediction performance of a homogeneous ensemble: Bagging, boosting, and voting classifiers (VC) methods combined with the under-sampling methods to address the class imbalance problem and improve the accuracy of SBP. Two ensembles are classified as bagging ensembles: decision tree (DT) and random forest (RF); two ensembles are classified as boosting ensembles: AdaBoost (AB) and gradient boosting (GB), while the DT, RF, K-Nearest Neighbours (K-NN), and support vector machine (SVM) are considered as VC. To establish the effectiveness of the proposed models, the experiments were conducted on the available benchmark datasets, which comprise five public datasets based on both class and file-level metrics. We compared and evaluated the performance of the proposed models according to several performance measures, namely accuracy, precision, recall, f-measure, Matthew’s correlation coefficient (MCC), and the area under the receiver operating characteristic curve (AUROC). The experimental findings demonstrated that the proposed models exhibit superior efficiency in predicting software bugs on balanced datasets compared to the original datasets, with an improvement of up to 11% accuracy for the class-level metrics and 10% for the file-level metrics. The results indicate that the use of data sampling techniques had a positive impact on the prediction accuracy of the presented models. We compared our proposed method with existing SBP methods based on several standard performance measures. The comparison outcomes revealed a significant superiority of our method over the prevailing state-of-the-art SBP methods across most datasets.

Read full abstract

Software defect prediction (SDP) plays a vital role in enhancing the quality of software projects and reducing maintenance-based risks through the ability to detect defective software components. SDP refers to using historical defect data to construct a relationship between software metrics and defects via diverse methodologies. Several prediction models, such as machine learning (ML) and deep learning (DL), have been developed and adopted to recognize software module defects, and many methodologies and frameworks have been presented. Class imbalance is one of the most challenging problems these models face in binary classification. However, When the distribution of classes is imbalanced, the accuracy may be high, but the models cannot recognize data instances in the minority class, leading to weak classifications. So far, little research has been done in the previous studies that address the problem of class imbalance in SDP. In this study, the data sampling method is introduced to address the class imbalance problem and improve the performance of ML models in SDP. The proposed approach is based on a convolutional neural network (CNN) and gated recurrent unit (GRU) combined with a synthetic minority oversampling technique plus the Tomek link (SMOTE Tomek) to predict software defects. To establish the efficiency of the proposed models, the experiments have been conducted on benchmark datasets obtained from the PROMISE repository. The experimental results have been compared and evaluated in terms of accuracy, precision, recall, F-measure, Matthew’s correlation coefficient (MCC), the area under the ROC curve (AUC), the area under the precision-recall curve (AUCPR), and mean square error (MSE). The experimental results showed that the proposed models predict the software defects more effectively on the balanced datasets than the original datasets, with an improvement of up to 19% for the CNN model and 24% for the GRU model in terms of AUC. We compared our proposed approach with existing SDP approaches based on several standard performance measures. The comparison results demonstrated that the proposed approach significantly outperforms existing state-of-the-art SDP approaches on most datasets.

Read full abstract

Quality Of Software Projects Research Articles

Related Topics

Articles published on Quality Of Software Projects

Ensemble-Based Machine Learning Algorithms Combined with Near Miss Method for Software Bug Prediction

Graph-Driven Exploration of Issue Handling Schemes in Software Projects

Detecting and resolving feature envy through automated machine learning and move method refactoring

AI-powered peer review process

A novel approach for software defect prediction using CNN and GRU based on SMOTE Tomek method

Factors Impacting Defect Density in Software Development Projects

An efficient heuristic algorithm for software module clustering optimization

Expert assessment of quality innovative software projects: choice of integrated index

Novel similarity measures, entropy of intuitionistic fuzzy sets and their application in software quality evaluation

Reliable Requirements Engineering Practices for COVID-19 Using Blockchain

Enhancement of the Capability Maturity Model for Improving the Quality of Software Projects in Developing Countries

Impact of time pressure on software quality: A laboratory experiment on a game-theoretical model.

Quality Assessment of Standard and Customized COTS Products

Cross-projects software defect prediction using spotted hyena optimizer algorithm

QUALITY OF SOFTWARE PROJECTS – A CASE STUDY

A Change Recommendation Approach Using Change Patterns of a Corresponding Test File

A complexity metric for object-oriented software

Soft skills requirements in mobile applications development employment market

Grouping environmental factors influencing individual decision‐making behavior in software projects: A cluster analysis

A Hybrid eBusiness Software Metrics Framework for Decision Making in Cloud Computing Environment

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Quality Of Software Projects Research Articles

Related Topics

Articles published on Quality Of Software Projects

Ensemble-Based Machine Learning Algorithms Combined with Near Miss Method for Software Bug Prediction

Graph-Driven Exploration of Issue Handling Schemes in Software Projects

Detecting and resolving feature envy through automated machine learning and move method refactoring

AI-powered peer review process

A novel approach for software defect prediction using CNN and GRU based on SMOTE Tomek method

Factors Impacting Defect Density in Software Development Projects

An efficient heuristic algorithm for software module clustering optimization

Expert assessment of quality innovative software projects: choice of integrated index

Novel similarity measures, entropy of intuitionistic fuzzy sets and their application in software quality evaluation

Reliable Requirements Engineering Practices for COVID-19 Using Blockchain

Enhancement of the Capability Maturity Model for Improving the Quality of Software Projects in Developing Countries

Impact of time pressure on software quality: A laboratory experiment on a game-theoretical model.

Quality Assessment of Standard and Customized COTS Products

Cross-projects software defect prediction using spotted hyena optimizer algorithm

QUALITY OF SOFTWARE PROJECTS – A CASE STUDY

A Change Recommendation Approach Using Change Patterns of a Corresponding Test File

A complexity metric for object-oriented software

Soft skills requirements in mobile applications development employment market

Grouping environmental factors influencing individual decision‐making behavior in software projects: A cluster analysis

A Hybrid eBusiness Software Metrics Framework for Decision Making in Cloud Computing Environment