DeGraphCS : Embedding Variable-based Flow Graph for Neural Code Search

Chen Zeng,Shanshan Li,Xin Xia,Yue Yu,Zhiming Wang,Linxiao Bai,Xiangke Liao,Wei Dong,Mingyang Geng

doi:10.1145/3546066

Abstract

With the rapid increase of public code repositories, developers maintain a great desire to retrieve precise code snippets by using natural language. Despite existing deep learning-based approaches that provide end-to-end solutions (i.e., accept natural language as queries and show related code fragments), the performance of code search in the large-scale repositories is still low in accuracy because of the code representation (e.g., AST) and modeling (e.g., directly fusing features in the attention stage). In this paper, we propose a novel learnable de ep G raph for C ode S earch (called deGraphCS ) to transfer source code into variable-based flow graphs based on an intermediate representation technique, which can model code semantics more precisely than directly processing the code as text or using the syntax tree representation. Furthermore, we propose a graph optimization mechanism to refine the code representation and apply an improved gated graph neural network to model variable-based flow graphs. To evaluate the effectiveness of deGraphCS , we collect a large-scale dataset from GitHub containing 41,152 code snippets written in the C language and reproduce several typical deep code search methods for comparison. The experimental results show that deGraphCS can achieve state-of-the-art performance and accurately retrieve code snippets satisfying the needs of the users.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeGraphCS : Embedding Variable-based Flow Graph for Neural Code Search

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: Mar 30, 2023
Citations: 15

Similar Papers

CSDA: A novel attention-based LSTM approach for code search
Leiming Ren ... Kai Wang
Journal of Physics: Conference Series | VOL. 1544
Leiming Ren, et. al.Leiming Ren ... Kai Wang
01 May 2020
Journal of Physics: Conference Series | VOL. 1544

Relationship-aware code search for JavaScript frameworks
Xuan Li ... Tao Xie
-
Xuan Li, et. al.Xuan Li ... Tao Xie
01 Nov 2016
01 Nov 2016

Query-oriented two-stage attention-based model for code search
Huanhuan Yang ... Luwen Huangfu
The Journal of Systems & Software | VOL. 210
Huanhuan Yang, et. al.Huanhuan Yang ... Luwen Huangfu
03 Jan 2024
The Journal of Systems & Software | VOL. 210

Incorporating Code Structure and Quality in Deep Code Search
Hao Yu ... Yuli Zhao
Applied Sciences | VOL. 12
Hao Yu, et. al.Hao Yu ... Yuli Zhao
16 Feb 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeGraphCS : Embedding Variable-based Flow Graph for Neural Code Search

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology