Structural Analysis of Source Code Collected from Programming Contests

Bokuk Park,Hwan Gue Cho,Haesung Tak

doi:10.1109/cit.2014.171

Abstract

Programming contests such as the International Olympiad for Informatics (IOI) and the International Collegiate Programming Contest (ICPC) are effective for encouraging young and bright programmers. These contests require contestants to complete a few tasks (between three and nine) related to algorithmic problems within a limited time. For this study, we collected a set of 2,400 programming codes submitted to the KOI (Korea Olympiad for Informatics) in 2011 and 2012 as well as 2,300 programming codes submitted at the preliminary contest session for the ICPC in 2009, 2011, and 2012 at the East-Asia regional contest. Because submitted source codes were evaluated with blind test cases, we can define a criteria to separate the high- and low-scoring students in the order of their respective scores. The main objective of this paper is to reveal the relationship between the task's proposed features, its difficulty, the school grade (elementary, middle-, and high-school), and the score. We do so with the data-mining tool WEKA. The ultimate goal of this study is to predict the score of some particular code with static analysis. We propose a simple and straightforward complexity measure based on the block-tree structure. We considered the high scoring student group as a positive class and the low scoring student group as negative class. The performance of the data mining classifier named Naive Bayes are evaluated based on 10-fold cross validation test. We decided that the meaningful classification for a harmonic mean of sensitivity and specificity is empirically larger than 0.6 empirically. Among the codes acquired through the KOI, we found a set of outlier codes that attempt to reply with the correct response to receive extra points. Among the codes acquired through the ICPC, we discovered that good collegiate programmers (i.e., Those with high score) attempt to keep their code more compact, both lexically and structurally. We used WEKA to analyze the code using code-features proposed in this study, and the results are detailed quantitatively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structural Analysis of Source Code Collected from Programming Contests

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Programming Contest Strategy Guide
Aaron Bloomfield ... Borja Sotomayor
-
Aaron Bloomfield, et. al.Aaron Bloomfield ... Borja Sotomayor
17 Feb 2016
17 Feb 2016

Increasing Programming Contest Participation for Fun and Profit (Abstract Only)
Aaron Bloomfield ... Borja Sotomayor
-
Aaron Bloomfield, et. al.Aaron Bloomfield ... Borja Sotomayor
17 Feb 2016
17 Feb 2016

Programming contest strategy
Andrew Trotman ... Chris Handley
Computers & Education | VOL. 50
Andrew Trotman, et. al.Andrew Trotman ... Chris Handley
19 Oct 2006
Computers & Education | VOL. 50

Students motivation for adopting programming contests: Innovation-diffusion perspective
Raghu Raman ... Krishnashree Achuthan
Education and Information Technologies | VOL. 23
Raghu Raman, et. al.Raghu Raman ... Krishnashree Achuthan
10 Apr 2018
Education and Information Technologies | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structural Analysis of Source Code Collected from Programming Contests

Abstract

Talk to us

Similar Papers