Which models of the past are relevant to the present? A software effort estimation approach to exploiting useful past models

Leandro L Minku,Xin Yao

doi:10.1007/s10515-016-0209-7

Leandro L Minku, Xin Yao

Open Access

https://doi.org/10.1007/s10515-016-0209-7

Copy DOI

Abstract

Software Effort Estimation (SEE) models can be used for decision-support by software managers to determine the effort required to develop a software project. They are created based on data describing projects completed in the past. Such data could include past projects from within the company that we are interested in (WC projects) and/or from other companies (cross-company, i.e., CC projects). In particular, the use of CC data has been investigated in an attempt to overcome limitations caused by the typically small size of WC datasets. However, software companies operate in non-stationary environments, where changes may affect the typical effort required to develop software projects. Our previous work showed that both WC and CC models of the past can become more or less useful over time, i.e., they can sometimes be helpful and sometimes misleading. So, how can we know if and when a model created based on past data represents well the current projects being estimated? We propose an approach called Dynamic Cross-company Learning (DCL) to dynamically identify which WC or CC past models are most useful for making predictions to a given company at the present. DCL automatically emphasizes the predictions given by these models in order to improve predictive performance. Our experiments comparing DCL against existing WC and CC approaches show that DCL is successful in improving SEE by emphasizing the most useful past models. A thorough analysis of DCL’s behaviour is provided, strengthening its external validity.

Highlights

Software effort estimation (SEE) is the process of estimating the effort required to develop a software project
Our experiments showed that Dynamic Cross-company Learning (DCL) always performed statistically significantly better than random guess with very high effect size
They confirm that DCL using both dynamic weighting and filtering performed to DCL-W, whereas DCL using only filtering (DCL-F) and dynamic weighting and no filtering (DCL-N) performed worse. These results show that dynamic weighting was essential to DCL’s predictive performance, as the best results for each particular dataset were always achieved when dynamic weighting was used (DCL-W or DCL)

Summary

Introduction

Software effort estimation (SEE) is the process of estimating the effort required to develop a software project. Software effort is typically the main cost driver in software projects (Jørgensen and Shepperd 2007; Stutzke 2006). Both over and underestimations of effort can cause problems to a company. Human-made effort estimations may be strongly affected by effort-irrelevant and misleading information, such as the font or margin size of specifications (Jørgensen and Grimstad 2011). Software engineers may not improve their effort estimations even after feedback about their estimates is provided (Gruschke and Jørgensen 2008). SEE models created using ML might not capture some human factors that influence the effort required to develop software projects. We believe that SEE models should be used as decisionsupport tools to help experts to perform or re-think their estimations

Objectives

Methods

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Automated Software Engineering	Publication Date: Dec 28, 2016
Citations: 21	License type: open-access

R Discovery Prime

R Discovery Prime

Which models of the past are relevant to the present? A software effort estimation approach to exploiting useful past models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automated Software Engineering

Lead the way for us

Similar Papers

Metaheuristic Algorithms in Optimizing Deep Neural Network Model for Software Effort Estimation
Muhammad Sufyan Khan ... Sanaa Ghouzali
IEEE Access | VOL. 9
Muhammad Sufyan Khan, et. al.Muhammad Sufyan Khan ... Sanaa Ghouzali
01 Jan 2020
IEEE Access | VOL. 9

A comparative evaluation on the accuracies of software effort estimates from clustered data
Sun-Jen Huang ... Yu-Jen Liu
Information and Software Technology | VOL. 50
Sun-Jen Huang, et. al.Sun-Jen Huang ... Yu-Jen Liu
19 Feb 2008
Information and Software Technology | VOL. 50

Machine learning approaches to estimating software development effort
K Srinivasan ... D Fisher
IEEE Transactions on Software Engineering | VOL. 21
K Srinivasan, et. al.K Srinivasan ... D Fisher
01 Jan 1995
IEEE Transactions on Software Engineering | VOL. 21

Predictive analytics approaches for software effort estimation: A review
A G Priya Varshini
Indian Journal of Science and Technology | VOL. 13
A G Priya VarshiniA G Priya Varshini
05 Jun 2020
Indian Journal of Science and Technology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Which models of the past are relevant to the present? A software effort estimation approach to exploiting useful past models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automated Software Engineering