Software Project Dataset Research Articles

Software effort estimation is a critical task in software project development management. Unfortunately, the uncertainty and inaccuracy are inherent properties of the software effort estimation environment. These are caused by the limited capabilities of the managers, to foresee, measure and describe factors influencing the software effort. The promising Fuzzy Analogy-based Software Effort Estimation model (FASEE) employs successfully fuzzy logic with approximate reasoning theory to handle imprecision and reasoning under uncertainty. Also, FASEE use possibility distribution to quantify the uncertainty in the estimate that aid the software managers to assess risks. Yet, the FASEE suffer from the low data quality and the uncertainty induced in the reasoning process. In this paper, we propose an enhancement of the FASEE, by imposing consistency criteria to deal with the aforementioned drawbacks. So, the underlying model, called Consistent Fuzzy Analogy-based Software Effort Estimation (C-FASEE) is endowed with two capabilities. The first one introduces consistency criteria in attribute representation by fuzzy sets to enable fitting each attribute to the software effort. The second one introduces a new relation of confidence to measure the extent that the resulted most similar projects respect the assumption “similar projects have similar efforts”. Moreover, the C-FASEE method provide a fuzzy estimate of the most possible fuzzy set will the true effort of the new software project falls in. This allow to the software manager to assess risks more optimally. The proposed C-FASEE is validated over thirteen software project datasets that represent different complexities. The obtained results are compared to variant methods of the analogy-based software effort estimation approach. The experimental results show that our proposal provides a good estimation accuracy of and has significantly best performance against the comparison methods.

Read full abstract

Software Defect Prediction has been an important part of Software engineering research since the 1970s. This technique is used to calculate and analyze the measurement and defect information of the historical software module to complete the defect prediction of the new software module. Currently, most software defect prediction model is established on the basis of the same software project data set. The training date sets used to construct the model and the test data sets used to validate the model are from the same software projects. But in practice, for those has less historical data of a software project or new projects, the defect of traditional prediction method shows lower forecast performance. For the traditional method, when the historical data is insufficient, the software defect prediction model cannot be fully studied. It is difficult to achieve high prediction accuracy. In the process of cross-project prediction, the problem that we will faced is data distribution differences. For the above problems, this paper presents a software defect prediction model based on migration learning and traditional software defect prediction model. This model uses the existing project data sets to predict software defects across projects. The main work of this article includes: 1) Data preprocessing. This section includes data feature correlation analysis, noise reduction and so on, which effectively avoids the interference of over-fitting problem and noise data on prediction results. 2) Migrate learning. This section analyzes two different but related project data sets and reduces the impact of data distribution differences. 3) Artificial neural networks. According to class imbalance problems of the data set, using artificial neural network and dynamic selection training samples reduce the influence of prediction results because of the positive and negative samples data. The data set of the Relink project and AEEEM is studied to evaluate the performance of the f-measure and the ROC curve and AUC calculation. Experiments show that the model has high predictive performance.

Read full abstract

Software Project Dataset Research Articles

Related Topics

Articles published on Software Project Dataset

Predicting the Number of Software Faults using Deep Learning

An efficient heuristic algorithm for software module clustering optimization

Hybrid PSO-SA approach for feature weighting in analogy-based software project effort estimation

Effort prediction for the software project construction phase

Software defect prediction with imbalanced distribution by radius‐synthetic minority over‐sampling technique

Automatic team recommendation for collaborative software development

Optimized COCOMO parameters using hybrid particle swarm optimization

Particle Swarm Optimization for Predicting the Development Effort of Software Projects

Empirical Evaluation of Mimic Software Project Data Sets for Software Effort Estimation

Ensemble learning for software fault prediction problem with imbalanced data

Uncertainty management in software effort estimation using a consistent fuzzy analogy-based method

Research and Appalication of Software Defect Predictionn based on BP-Migration learning

Software Cost Estimation Using Environmental Adaptation Method

Case-based reasoning with optimized weight derived by particle swarm optimization for software effort estimation

A multi-release software reliability modeling for open source software incorporating dependent fault detection process

Cross-validation based K nearest neighbor imputation for software quality datasets: An empirical study

An Investigation into the Suitability of k-Nearest Neighbour (k-NN) for Software Effort Estimation

The ISBSG Software Project Repository: An Analysis from Six Sigma Measurement Perspective for Software Defect Estimation

An empirical study of some software fault prediction techniques for the number of faults prediction

A Novel Technique of Optimization for the COCOMO II Model Parameters using Teaching-Learning-Based Optimization Algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Software Project Dataset Research Articles

Related Topics

Articles published on Software Project Dataset

Predicting the Number of Software Faults using Deep Learning

An efficient heuristic algorithm for software module clustering optimization

Hybrid PSO-SA approach for feature weighting in analogy-based software project effort estimation

Effort prediction for the software project construction phase

Software defect prediction with imbalanced distribution by radius‐synthetic minority over‐sampling technique

Automatic team recommendation for collaborative software development

Optimized COCOMO parameters using hybrid particle swarm optimization

Particle Swarm Optimization for Predicting the Development Effort of Software Projects

Empirical Evaluation of Mimic Software Project Data Sets for Software Effort Estimation

Ensemble learning for software fault prediction problem with imbalanced data

Uncertainty management in software effort estimation using a consistent fuzzy analogy-based method

Research and Appalication of Software Defect Predictionn based on BP-Migration learning

Software Cost Estimation Using Environmental Adaptation Method

Case-based reasoning with optimized weight derived by particle swarm optimization for software effort estimation

A multi-release software reliability modeling for open source software incorporating dependent fault detection process

Cross-validation based K nearest neighbor imputation for software quality datasets: An empirical study

An Investigation into the Suitability of k-Nearest Neighbour (k-NN) for Software Effort Estimation

The ISBSG Software Project Repository: An Analysis from Six Sigma Measurement Perspective for Software Defect Estimation

An empirical study of some software fault prediction techniques for the number of faults prediction

A Novel Technique of Optimization for the COCOMO II Model Parameters using Teaching-Learning-Based Optimization Algorithm