Mining Common Syntactic Patterns used by Java Programmers

Alvaro Losada,Francisco Ortin,Guillermo Facundo,Miguel Garcia

doi:10.1109/tla.2022.9693559

Abstract

Open source code repositories provide massive data as programs that have been used to develop different tools. These kinds of works have been included in the active Big Code and Mining Software Repositories research fields. Although different machine learning works already classify the syntactic constructs used by programmers, there are no reports about the most common syntactic patterns used by Java programmers. In this article, we describe a system we build to provide such a report. Our system retrieves the syntactic patterns used by Java programmers, distinguishing those utilized by experts and beginners. We also present the anomalies found in the usage of different syntactic constructs. We modify the OpenJDK compiler to double the syntactic information included in its Abstract Syntax Tree (AST), define a mechanism to translate ASTs into n-dimensional vectors, combine the information of different syntax constructs to build heterogeneous patterns, and apply the Frequent Pattern Growth algorithm to mine the syntactic patterns as association rules. The mined patterns allow expressing hierarchical subpatterns connected to one another, providing a high level of expressiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining Common Syntactic Patterns used by Java Programmers

Abstract

Talk to us

Similar Papers

More From: IEEE Latin America Transactions

Lead the way for us

Journal: IEEE Latin America Transactions	Publication Date: May 1, 2022
Citations: 1

Similar Papers

Query by example in large-scale code repositories
Vipin Balachandran
-
Vipin BalachandranVipin Balachandran
01 Sep 2015
01 Sep 2015

Concurrency State Models and Java Programs

Scalable Computing Practice and Experience | VOL. 3

01 Jan 1999
Scalable Computing Practice and Experience | VOL. 3

Heterogeneous tree structure classification to label Java programmers according to their expertise level
Francisco Ortin ... Miguel Garcia
Future Generation Computer Systems | VOL. 105
Francisco Ortin, et. al.Francisco Ortin ... Miguel Garcia
16 Dec 2019
Future Generation Computer Systems | VOL. 105

Automated detection of code smells caused by null checking conditions in Java programs
Kriangchai Sirikul ... Chitsutha Soomlek
-
Kriangchai Sirikul, et. al.Kriangchai Sirikul ... Chitsutha Soomlek
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining Common Syntactic Patterns used by Java Programmers

Abstract

Talk to us

Similar Papers

More From: IEEE Latin America Transactions