Antecedents of open source software defects: A data mining approach to model formulation, validation and testing

Uzma Raja,Marietta J. Tretter

doi:10.1007/s10799-009-0062-5

Abstract

This paper develops tests and validates a model for the antecedents of open source software (OSS) defects, using Data and Text Mining. The public archives of OSS projects are used to access historical data on over 5,000 active and mature OSS projects. Using domain knowledge and exploratory analysis, a wide range of variables is identified from the process, product, resource, and end-user characteristics of a project to ensure that the model is robust and considers all aspects of the system. Multiple Data Mining techniques are used to refine the model and data is enriched by the use of Text Mining for knowledge discovery from qualitative information. The study demonstrates the suitability of Data Mining and Text Mining for model building. Results indicate that project type, end-user activity, process quality, team size and project popularity have a significant impact on the defect density of operational OSS projects. Since many organizations, both for profit and not for profit, are beginning to use Open Source Software as an economic alternative to commercial software, these results can be used in the process of deciding what software can be reasonably maintained by an organization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Antecedents of open source software defects: A data mining approach to model formulation, validation and testing

Abstract

Talk to us

Similar Papers

More From: Information Technology and Management

Lead the way for us

Journal: Information Technology and Management	Publication Date: Nov 24, 2009
Citations: 79

Similar Papers

Special issue devoted to papers presented at the second INFORMS workshop on artificial intelligence and data mining, Seattle, November 3, 2007
Wei Jiang ... Anurag Agarwal
Information Technology and Management | VOL. 10
Wei Jiang, et. al.Wei Jiang ... Anurag Agarwal
25 Feb 2009
Information Technology and Management | VOL. 10

Panel — Teaching students to participate in Open Source Software projects
Heidi J C Ellis ... Clif Kussmaul
-
Heidi J C Ellis, et. al.Heidi J C Ellis ... Clif Kussmaul
01 Oct 2010
01 Oct 2010

Team discussions and dynamics during DevOps tool adoptions in OSS projects
Likang Yin ... Vladimir Filkov
-
Likang Yin, et. al.Likang Yin ... Vladimir Filkov
21 Dec 2020
21 Dec 2020

OSS in Software Engineering Education
Fernanda Gomes Silva ... Paulo Ezequiel D Santos
Journal of Software Engineering Research and Development | VOL. -
Fernanda Gomes Silva, et. al.Fernanda Gomes Silva ... Paulo Ezequiel D Santos
18 Jan 2023
Journal of Software Engineering Research and Development | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Antecedents of open source software defects: A data mining approach to model formulation, validation and testing

Abstract

Talk to us

Similar Papers

More From: Information Technology and Management