Abstract
The source codes of a software system are one of the most valuable resources in software development. A developer spends more time and money on development if they start coding from fresh for each similar functionality project. Topic modelling is crucial for software reuse since it can be difficult for developers to remove obsolete source code from large software systems with a lot of code. Topic modelling techniques has been used to evaluate and model source codes in several ways. Several studies on this domain have been used to extract topics from source codes using various statistical approaches and methodologies. These topic extraction methods are interconnected, and if software best practices aren't followed in older systems, it may ensue unreliability of the outcome. In response to these findings, the author conducted a study in which he extracted source code using a java parser library and predicted the source code functionality name using an unsupervised learning approach such as K-mean algorithm and Expectation Maximization (EM) clustering approach. This is the first effort to develop a clustering model to predict the semantic function name of source code using an unsupervised learning approach and compare the algorithms to get the optimal model.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have