Abstract

Key classes have become excellent starting points for developers to understand unknown software systems. Up to now, a variety of approaches have been proposed to mine key classes in a software project. Many of them are based on a network representation (namely, software networks) of the software projects. However, the software networks they used are usually un-weighted and un-directed, which is not consistent with the reality in a real software project where the coupling actually has direction and strength. Worse still, the number of key class candidates returned by existing approaches is usually very large. Thus, it is usually infeasible for developers to start the comprehension process from these classes, especially when there are tight time and resource constraints. To tackle these problems, in this paper, we propose an approach named MinClass, to Mine key Classes in Java projects by examining a very small number of classes. First, the software structure at the class level is represented by a weighted directed software network, which considers both the coupling strength and direction between every pair of classes. Second, we propose a new metric, OSE (One-order Structural Entropy), and use it to calculate the importance of each class in the system. Finally, we sort classes in descending order according to their OSE values, and a small number of top-ranked classes are treated as the key class candidates identified by MinClass. Experiments are performed on six open-source Java projects, and comparison studies with other eight state-of-the-art approaches are also performed. Results show that, although no one method performs best in all software systems, MinClass is the most promising one. It performs best in the whole set of software systems according to the average ranking of the Friedman test. Thus, MinClass is a valuable technique that can be used to mine the key classes.

Highlights

  • Software is always born to solve real problems, but real problems and requirements are always changing [1]

  • We sort classes in descending order according to their OSE values, and a small number of top-ranked classes are treated as the key class candidates identified by MinClass

  • We propose a new problem on key class identification, i.e., identifying key classes by only examining a very small of classes in a specific software project

Read more

Summary

INTRODUCTION

Software is always born to solve real problems, but real problems and requirements are always changing [1]. The aim of this paper is to apply the key node identification technology in complex networks to software networks, so as to provide developers with an ordered list of a small number of key classes (we use term classes to denote both classes and interfaces from here on) for an unknown software project. To this aim, we propose an approach, named MinClass, to Mine key Classes in Java projects by examining a very small number of classes. The rest of this paper is structured as follows: Section II briefly summarizes the related work on identifying key classes in software projects, and Section III introduces our software network and OSE metric in detail.

RELATED WORK
EMPIRICAL STUDY
CONCLUSIONS AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call