Boosting source code suggestion with self-supervised Transformer Gated Highway

Yasir Hussain,Zhiqiu Huang,Yu Zhou,Senzhang Wang

doi:10.1016/j.jss.2022.111553

Abstract

Attention-based transformer language models have shown significant performance gains in various natural language tasks. In this work, we explore the impact of transformer language models on the task of source code suggestion. The core intention of this work is to boost the modeling performance for the source code suggestion task and to explore how the training procedures and model architectures impact modeling performance. Additionally, we propose a transformer-based self-supervised learning technique called Transformer Gated Highway that outperforms recurrent and transformer language models of comparable size. The proposed approach combines the Transformer language model with Gated Highway introducing a notion of recurrence. We compare the performance of the proposed approach with transformer-based BERT (CodeTran), RoBERTa (RoBERTaCode), GPT2 (TravTrans), CodeGen and recurrent neural language-based LSTM (CodeLSTM) models. Moreover, we have experimented with various architectural settings for the transformer models to evaluate their impact on modeling performance. The extensive evaluation of the presented approach exhibits better performance on two programming language datasets; Java and C#. Additionally, we have adopted the presented approach for the syntax error correction task to predict the correct syntax token to render its possible implications for other source code modeling tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Boosting source code suggestion with self-supervised Transformer Gated Highway

Abstract

Talk to us

Similar Papers

More From: The Journal of Systems & Software

Lead the way for us

Journal: The Journal of Systems & Software	Publication Date: Nov 5, 2022
Citations: 6

Similar Papers

Morphology aware data augmentation with neural language models for online hybrid ASR
Balázs Tarján ... Tibor Fegyó
Acta Linguistica Academica | VOL. 69
Balázs Tarján, et. al.Balázs Tarján ... Tibor Fegyó
12 Dec 2022
Acta Linguistica Academica | VOL. 69

The Rwth Asr System for Ted-Lium Release 2: Improving Hybrid Hmm With Specaugment
Wei Zhou ... Wilfried Michel
-
Wei Zhou, et. al.Wei Zhou ... Wilfried Michel
01 May 2020
01 May 2020

Ransomware Detection by Distinguishing API Call Sequences through LSTM and BERT Models
Tu-Liang Lin ... Wha-Lee Tseng
The Computer Journal | VOL. 67
Tu-Liang Lin, et. al.Tu-Liang Lin ... Wha-Lee Tseng
02 Mar 2023
The Computer Journal | VOL. 67

MKPM: Multi keyword-pair matching for natural language sentences
Xin Lu ... Yi Gao
Applied Intelligence | VOL. 52
Xin Lu, et. al.Xin Lu ... Yi Gao
31 May 2021
Applied Intelligence | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Boosting source code suggestion with self-supervised Transformer Gated Highway

Abstract

Talk to us

Similar Papers

More From: The Journal of Systems & Software