SAViP: Semantic-Aware Vulnerability Prediction for Binary Programs with Neural Networks

Xu Zhou,Xugang Wu,Pengfei Wang,Bingjie Duan

doi:10.3390/app13042271

Abstract

Vulnerability prediction, in which static analysis is leveraged to predict the vulnerabilities of binary programs, has become a popular research topic. Traditional vulnerability prediction methods depend on vulnerability patterns, which must be predefined by security experts in a time-consuming manner. The development of Artificial Intelligence (AI) has yielded new options for vulnerability prediction. Neural networks allow vulnerability patterns to be learned automatically. However, current works extract only one or two types of features and use traditional models such as word2vec, which results in the loss of much instruction-level information. In this paper, we propose a model named SAViP to predict vulnerabilities in binary programs. To fully extract binary information, we integrate three kinds of features: semantic, statistical, and structural features. For semantic features, we apply the Masked Language Model (MLM) pre-training task of the RoBERTa model to the assembly code to build our language model. Using this model, we innovatively combine the beginning token and the operation-code token to create the instruction embedding. For the statistical features, we design a 56-dimensional feature vector that contains 43 kinds of instructions. For the structural features, we improve the ability of the structure2vec network to obtain the characteristic of the network by emphasizing node self-attention. Through these optimizations, we significantly increase the accuracy of vulnerability prediction over existing methods. Our experiments show that SAViP achieves a recall of 77.85% and Top 100∼600 accuracies all above 95%. The results are 10% and 13% higher than those of the state-of-the-art V-Fuzz, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SAViP: Semantic-Aware Vulnerability Prediction for Binary Programs with Neural Networks

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Feb 10, 2023
License type: CC BY 4.0

Similar Papers

ChatGPT Isn't Magic
Tama Leaver ... Suzanne Srdarov
M/C Journal | VOL. 26
Tama Leaver, et. al.Tama Leaver ... Suzanne Srdarov
02 Oct 2023
M/C Journal | VOL. 26

Conditional BERT Contextual Augmentation
Xing Wu ... Shangwen Lv
-
Xing Wu, et. al.Xing Wu ... Shangwen Lv
01 Jan 2019
01 Jan 2019

Cultural Understanding Using In-context Learning and Masked Language Modeling
Ming Qian ... Davis Qian
-
Ming Qian, et. al.Ming Qian ... Davis Qian
01 Jan 2020
01 Jan 2020

The Latest in Natural Language Generation: Trends, Tools and Applications in Industry
Heinrihs Kristians Skrodelis ... Henrihs Gorskis
-
Heinrihs Kristians Skrodelis, et. al.Heinrihs Kristians Skrodelis ... Henrihs Gorskis
27 Apr 2023
27 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SAViP: Semantic-Aware Vulnerability Prediction for Binary Programs with Neural Networks

Abstract

Talk to us

Similar Papers

More From: Applied Sciences