Replicating studies on cross- vs single-company effort models using the ISBSG Database

Emilia Mendes,Chris Lokan

doi:10.1007/s10664-007-9045-5

Abstract

In 2001 the ISBSG database was used by Jeffery et al. (Using public domain metrics to estimate software development effort. Proceedings Metrics'01, London, pp 16---27, 2001; S1) to compare the effort prediction accuracy between cross- and single-company effort models. Given that more than 2,000 projects were later volunteered to this database, in 2005 Mendes et al. (A replicated comparison of cross-company and within-company effort estimation models using the ISBSG Database, in Proceedings of Metrics'05, Como, 2005; S2) replicated S1 but obtained different results. The difference in results could have occurred due to legitimate differences in data set patterns; however, they could also have occurred due to differences in experimental procedure given that S2 was unable to employ exactly the same experimental procedure used in S1 because S1's procedure was not fully documented. Recently, we applied S2's experimental procedure to the ISBSG database version used in S1 (release 6) to assess if differences in experimental procedure would have contributed towards different results (Lokan and Mendes, Cross-company and single-company effort models using the ISBSG Database: a further replicated study, Proceedings of the ISESE'06, pp 75---84, 2006; S3). Our results corroborated those from S1, suggesting that differences in the results obtained by S2 were likely caused by legitimate differences in data set patterns. We have since been able to reconstruct the experimental procedure of S1 and therefore in this paper we present both S3 and also another study (S4), which applied the experimental procedure of S1 to the data set used in S2. By applying the experimental procedure of S2 to the data set used in S1 (study S3), and the experimental procedure of S1 to the data set used in S2 (study S4), we investigate the effect of all the variations between S1 and S2. Our results for S4 support those of S3, suggesting that differences in data preparation and analysis procedures did not affect the outcome of the analysis. Thus, the different results of S1 and S2 are very likely due to fundamental differences in the data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Replicating studies on cross- vs single-company effort models using the ISBSG Database

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Aug 8, 2007
Citations: 56

Similar Papers

Cross-company and single-company effort models using the ISBSG database
Chris Lokan ... Emilia Mendes
-
Chris Lokan, et. al.Chris Lokan ... Emilia Mendes
21 Sep 2006
21 Sep 2006

Enthalpy relaxation in polyvinyl acetate
John M Hutchinson ... P Kumar
Thermochimica Acta | VOL. 391
John M Hutchinson, et. al.John M Hutchinson ... P Kumar
17 May 2002
Thermochimica Acta | VOL. 391

Comparative phosphoproteomics reveals evolutionary and functional conservation of phosphorylation across eukaryotes
Jos Boekhorst ... Berend Snel
Genome Biology | VOL. 9
Jos Boekhorst, et. al.Jos Boekhorst ... Berend Snel
01 Jan 2008
Genome Biology | VOL. 9

Pre‐resonance Raman excitation profile of the 3400 cm−1 mode of liquid water
S R Ahmad ... A Iles
Journal of Raman Spectroscopy | VOL. 32
S R Ahmad, et. al.S R Ahmad ... A Iles
01 Aug 2001
Pre‐resonance Raman excitation profile of the 3400 cm−1 mode of liquid water
S R Ahmad ... A Iles

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Replicating studies on cross- vs single-company effort models using the ISBSG Database

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering