Learning Program Representations with a Tree-Structured Transformer

Wenhan Wang,Kechi Zhang,Yang Liu,Zhi Jin,Shangqing Liu,Ge Li,Anran Li

doi:10.1109/saner56733.2023.00032

Abstract

Learning vector representations for programs is a critical step in applying deep learning techniques for program understanding tasks. Various neural network models are proposed to learn from tree-structured program representations, e.g., abstract syntax tree (AST) and concrete syntax tree (CST). However, most neural architectures either fail to capture long-range dependencies which are ubiquitous in programs, or cannot learn effective representations for syntax tree nodes, making them incapable of performing the node-level prediction tasks, e.g., bug localization. In this paper, we propose Tree-Transformer, a novel recursive tree-structured neural network to learn the vector representations for source codes. We propose a multi-head attention mechanism to model the dependency between siblings and parent-children node pairs. Moreover, we propose a bi-directional propagation strategy to allow node information passing in two directions, bottom-up and top-down along trees. In this way, Tree-Transformer can learn the information of the node features as well as the global contextual information. The extensive experimental results show that our Tree-Transformer significantly outperforms the existing tree-based and graph-based program representation learning approaches in both the tree-level and node-level prediction tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Program Representations with a Tree-Structured Transformer

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Influence of Contrastive Learning on Source Code Plagiarism Detection through Recursive Neural Networks
Manuel A Fokam ... Ritesh Ajoodha
-
Manuel A Fokam, et. al.Manuel A Fokam ... Ritesh Ajoodha
23 Nov 2021
23 Nov 2021

Deep Learning With Customized Abstract Syntax Tree for Bug Localization
Hongliang Liang ... Lu Sun
IEEE Access | VOL. 7
Hongliang Liang, et. al.Hongliang Liang ... Lu Sun
01 Jan 2019
IEEE Access | VOL. 7

GC–HGNN: A global-context supported hypergraph neural network for enhancing session-based recommendation
Dunlu Peng ... Shuo Zhang
Electronic Commerce Research and Applications | VOL. 52
Dunlu Peng, et. al.Dunlu Peng ... Shuo Zhang
01 Mar 2022
Electronic Commerce Research and Applications | VOL. 52

Computations over Functional Programs
Dale Miller ... Gopalan Nadathur
-
Dale Miller, et. al.Dale Miller ... Gopalan Nadathur
11 Jun 2012
11 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Program Representations with a Tree-Structured Transformer

Abstract

Talk to us

Similar Papers