Abstract
Code clone genealogy detection has been widely used in refactoring and maintenance of code in software systems. It identifies history of code clones in a source code repository. Existing code clone genealogy detection techniques are based on text similarity for extraction of different versions of program. However, existing proposals in literature for code clone genealogy detection fail to extract code clone genealogy on Git (a distributed version control system). To address this issue, we presented a new scheme, based on transitive closure computation that detects code clone genealogies in e-health system on Git. Our insight is to extract clone pairs from thousands of commits on a software system in Git by transitive closure computation. We implemented the proposed scheme using Hadoop and Neo4j on Open Medical Record System. The results prove efficacy of the proposed scheme using parameters such as transitive depth, ratio of similarity, and count of clones.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have