A COMPARATIVE STUDY OF ABSENT FEATURES AND UNOBSERVED VALUES IN SOFTWARE EFFORT DATA

Wen Zhang,Qing Wang,Ye Yang

doi:10.1142/s0218194012400025

Abstract

Software effort data contains a large amount of missing values of project attributes. The problem of absent features, which occurred recently in machine learning, is often neglected by researchers of software engineering when handling the missingness in software effort data. In essence, absent features (structural missingness) and unobserved values (unstructured missingness) are different cases of missingness although their appearance in the data set are the same. This paper attempts to clarify the root cause of missingness of software effort data. When regarding missingness as absent features, we develop Max-margin regression to predict real effort of software projects. When regarding missingness as unobserved values, we use existing imputation techniques to impute missing values. Then, ε – SVR is used to predict real effort of software projects with the input data sets. Experiments on ISBSG (International Software Benchmarking Standard Group) and CSBSG (Chinese Software Benchmarking Standard Group) data sets demonstrate that, with the tasks of effort prediction, the treatment regarding missingness in software effort data set as unobserved values can produce more desirable performance than that of regarding missingness as absent features. This paper is the first to introduce the concept of absent features to deal with missingness of software effort data.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A COMPARATIVE STUDY OF ABSENT FEATURES AND UNOBSERVED VALUES IN SOFTWARE EFFORT DATA

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering

Lead the way for us

Journal: International Journal of Software Engineering and Knowledge Engineering	Publication Date: Mar 1, 2012
Citations: 9

Similar Papers

The usage of ISBSG data fields in software effort estimation: A systematic mapping study
Fernando González-Ladrón-De-Guevara ... Chris Lokan
Journal of Systems and Software | VOL. 113
Fernando González-Ladrón-De-Guevara, et. al.Fernando González-Ladrón-De-Guevara ... Chris Lokan
02 Dec 2015
Journal of Systems and Software | VOL. 113

Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review
Marta Fernández-Diego ... Fernando González-Ladrón-De-Guevara
Information and Software Technology | VOL. 56
Marta Fernández-Diego, et. al.Marta Fernández-Diego ... Fernando González-Ladrón-De-Guevara
17 Jan 2014
Information and Software Technology | VOL. 56

The ISBSG Software Project Repository: An Analysis from Six Sigma Measurement Perspective for Software Defect Estimation
Mhammed Almakadmeh ... Alain Abran
Journal of Software Engineering and Applications | VOL. 10
Mhammed Almakadmeh, et. al.Mhammed Almakadmeh ... Alain Abran
01 Jan 2017
Journal of Software Engineering and Applications | VOL. 10

Comparison of estimation methods of cost and duration in IT projects
Stanislav Berlin ... Moshe Zviran
Information and Software Technology | VOL. 51
Stanislav Berlin, et. al.Stanislav Berlin ... Moshe Zviran
04 Nov 2008
Information and Software Technology | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A COMPARATIVE STUDY OF ABSENT FEATURES AND UNOBSERVED VALUES IN SOFTWARE EFFORT DATA

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering