CARL: Unsupervised Code-Based Adversarial Attacks for Programming Language Models via Reinforcement Learning

Kaichun Yao,Libo Zhang,Hao Wang,Hengshu Zhu,Chuan Qin,Yanjun Wu

doi:10.1145/3688839

Abstract

Code based adversarial attacks play a crucial role in revealing vulnerabilities of software system. Recently, pre-trained programming language models (PLMs) have demonstrated remarkable success in various significant software engineering tasks, progressively transforming the paradigm of software development. Despite their impressive capabilities, these powerful models are vulnerable to adversarial attacks. Therefore, it is necessary to carefully investigate the robustness and vulnerabilities of the PLMs by means of adversarial attacks. Adversarial attacks entail imperceptible input modifications that cause target models to make incorrect predictions. Existing approaches for attacking PLMs often employ either identifier renaming or the greedy algorithm, which may yield sub-optimal performance or lead to high inference times. In response to these limitations, we propose CARL, an unsupervised black-box attack model that leverages reinforcement learning to generate imperceptible adversarial examples. Specifically, CARL comprises a programming language encoder and a perturbation prediction layer. In order to achieve more effective and efficient attack, we cast the task as a sequence decision-making process, optimizing through policy gradient with a suite of reward functions. We conduct extensive experiments to validate the effectiveness of CARL on code summarization, code translation, and code refinement tasks, covering various programming languages and PLMs. The experimental results demonstrate that CARL surpasses state-of-the-art code attack models, achieving the highest attack success rate across multiple tasks and PLMs while maintaining high attack efficiency, imperceptibility, consistency, and fluency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CARL: Unsupervised Code-Based Adversarial Attacks for Programming Language Models via Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Similar Papers

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

Rethinking Textual Adversarial Defense for Pre-Trained Language Models
Jiayi Wang ... Rongzhou Bao
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Jiayi Wang, et. al.Jiayi Wang ... Rongzhou Bao
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models
Akshita Jha ... Chandan K Reddy
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Akshita Jha, et. al.Akshita Jha ... Chandan K Reddy
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Adversarial Examples in Deep Neural Networks: An Overview
Emilio Rafael Balda ... Rudolf Mathar
-
Emilio Rafael Balda, et. al.Emilio Rafael Balda ... Rudolf Mathar
24 Oct 2019
24 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CARL: Unsupervised Code-Based Adversarial Attacks for Programming Language Models via Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology