Application of Seq2Seq Models on Code Correction.

Shan Huang,Xiao Zhou,Sang Chin

doi:10.3389/frai.2021.590215

Abstract

We apply various seq2seq models on programming language correction tasks on Juliet Test Suite for C/C++ and Java of Software Assurance Reference Datasets and achieve 75% (for C/C++) and 56% (for Java) repair rates on these tasks. We introduce pyramid encoder in these seq2seq models, which significantly increases the computational efficiency and memory efficiency, while achieving similar repair rate to their nonpyramid counterparts. We successfully carry out error type classification task on ITC benchmark examples (with only 685 code instances) using transfer learning with models pretrained on Juliet Test Suite, pointing out a novel way of processing small programming language datasets.

Highlights

Programming language correction (PLC), which can provide suggestions for people to debug code, identify potential flaws in a program, and help programmers to improve their coding skills, has been an important topic in the Natural Language Processing (NLP) area
We did not finetune these parameters, because (1) we show that the overall performance of seq2seq model on PLC problem is satisfying and (2) we are more concerned about comparison between different attention mechanisms and between pyramid encoder and regular encoder
We show that seq2seq models, successful in natural language correction, are applicable in programming language correction

Summary

Introduction

Programming language correction (PLC), which can provide suggestions for people to debug code, identify potential flaws in a program, and help programmers to improve their coding skills, has been an important topic in the Natural Language Processing (NLP) area. The syntax error problem is relatively well studied; most compilers are able to catch syntax errors, and correcting syntax errors manually is not difficult even for beginner programmers. The latter problem, is much more challenging due to several reasons. Recognizing and correcting these bugs requires a higher level of understanding of the code, including identifying the relationship between objects, making connections between blocks, and matching data types. These errors could be seen in even experienced programmers and can be time consuming to correct manually. This study will focus on automatic correction of these logic errors in code body that pass compiling stage

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Artificial Intelligence	Publication Date: Mar 19, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Application of Seq2Seq Models on Code Correction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence

Lead the way for us

Similar Papers

Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.
Renren Bai ... Jiamin Ge
Molecules | VOL. 25
Renren Bai, et. al.Renren Bai ... Jiamin Ge
19 May 2020
Molecules | VOL. 25

Cluster-based deep transfer learning with attention mechanism for residential air conditioning systems
Yoondong Sung ... Woohyun Kim
Applied Thermal Engineering | VOL. 231
Yoondong Sung, et. al.Yoondong Sung ... Woohyun Kim
18 Jun 2023
Applied Thermal Engineering | VOL. 231

Video Transcript Extraction and Summarization Using Transfer Learning
Varun Mehta ... Tushar Deshpande
-
Varun Mehta, et. al.Varun Mehta ... Tushar Deshpande
01 Jan 2023
01 Jan 2023

IntJect: Vulnerability Intent Bug Seeding
Benjamin Petit ... Mike Papadakis
-
Benjamin Petit, et. al.Benjamin Petit ... Mike Papadakis
01 Dec 2022
01 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of Seq2Seq Models on Code Correction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence