Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

Rod Hilton,Ellen Gethner

doi:10.1142/s0218194018500110

Rod Hilton, Ellen Gethner

https://doi.org/10.1142/s0218194018500110

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Software engineers are able to measure the quality of their code using a variety of metrics that can be derived directly from analyzing the source code. These internal quality metrics are valuable to engineers, but the organizations funding the software development effort find external quality metrics such as defect rates and time to develop features more valuable. Unfortunately, external quality metrics can only be calculated after costly software has been developed and deployed for end-users to utilize. Here, we present a method for mining data from freely available open source codebases written in Java to train a Random Forest classifier to predict which files are likely to be external quality hotspots based on their internal quality metrics with over 75% accuracy. We also used the trained model to predict hotspots for a Java project whose data was not used to train the classifier and achieved over 75% accuracy again, demonstrating the method’s general applicability to different projects.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering

Lead the way for us

Journal: International Journal of Software Engineering and Knowledge Engineering	Publication Date: Mar 1, 2018
Citations: 1

Similar Papers

Feature subspace transformations for enhancing k-means clustering
Anirban Chatterjee ... Sanjukta Bhowmick
-
Anirban Chatterjee, et. al.Anirban Chatterjee ... Sanjukta Bhowmick
26 Oct 2010
26 Oct 2010

Attributes and metrics of internal quality that impact the external quality of object-oriented software: A systematic literature review
Danilo Santos ... Heitor Costa
-
Danilo Santos, et. al.Danilo Santos ... Heitor Costa
01 Oct 2016
01 Oct 2016

Overlapping community detection in weighted networks via hierarchical clustering.
Petr Prokop ... Jan Platoš
PloS one | VOL. 19
Petr Prokop, et. al.Petr Prokop ... Jan Platoš
28 Oct 2024
PloS one | VOL. 19

A Quality Framework for Evaluating Grammatical Structure of User Stories to Improve External Quality
Samantha Jimenez ... Reyes Juarez-Ramirez
-
Samantha Jimenez, et. al.Samantha Jimenez ... Reyes Juarez-Ramirez
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Software Engineering and Knowledge Engineering