AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual Adaptation for Code Clone Detection

Yangkai Du,Tengfei Ma,Xuhong Zhang,Lingfei Wu,Shouling Ji

doi:10.1609/aaai.v38i16.29749

Abstract

Code Clone Detection, which aims to retrieve functionally similar programs from large code bases, has been attracting increasing attention. Modern software often involves a diverse range of programming languages. However, current code clone detection methods are generally limited to only a few popular programming languages due to insufficient annotated data as well as their own model design constraints. To address these issues, we present AdaCCD, a novel cross-lingual adaptation method that can detect cloned codes in a new language without annotations in that language. AdaCCD leverages language-agnostic code representations from pre-trained programming language models and propose an Adaptively Refined Contrastive Learning framework to transfer knowledge from resource-rich languages to resource-poor languages. We evaluate the cross-lingual adaptation results of AdaCCD by constructing a multilingual code clone detection benchmark consisting of 5 programming languages. AdaCCD achieves significant improvements over other baselines, and achieve comparable performance to supervised fine-tuning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual Adaptation for Code Clone Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

XCode : Towards Cross-Language Code Representation with Large-Scale Pre-Training
Zehao Lin ... Xiangji Zeng
ACM Transactions on Software Engineering and Methodology | VOL. 31
Zehao Lin, et. al.Zehao Lin ... Xiangji Zeng
09 Apr 2022
ACM Transactions on Software Engineering and Methodology | VOL. 31

A Comparative Analysis of Clone Detection Techniques on SemanticCloneBench
Sohaib Masood Rabbani ... Shamsa Abid
-
Sohaib Masood Rabbani, et. al.Sohaib Masood Rabbani ... Shamsa Abid
01 Oct 2022
01 Oct 2022

Effective Hard Negative Mining for Contrastive Learning-based Code Search
Ye Fan ... Chuanyi Li
ACM Transactions on Software Engineering and Methodology | VOL. -
Ye Fan, et. al.Ye Fan ... Chuanyi Li
11 Oct 2024
ACM Transactions on Software Engineering and Methodology | VOL. -

Code Clone Detection Based on Bytecode and Twin Neural Networks
Zhenhao Li ... Hang Lei
Applied Mathematics and Nonlinear Sciences | VOL. 9
Zhenhao Li, et. al.Zhenhao Li ... Hang Lei
01 Jan 2024
Applied Mathematics and Nonlinear Sciences | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AdaCCD: Adaptive Semantic Contrasts Discovery Based Cross Lingual Adaptation for Code Clone Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence