A Modified Maximal Divergence Sequential Auto-Encoder and Time Delay Neural Network Models for Vulnerable Binary Codes Detection

Marwan Ali Albahar

doi:10.1109/access.2020.2965726

Abstract

Since the risks associated with software vulnerabilities are rapidly increasing, the detection of vulnerabilities in binary code has become an important area of concern for the software community. However, research studies associated with the detection of vulnerabilities in binary code remain limited to the handcrafted features referenced by a specific group of experts in the field. This paper considers other possibilities to add on the subject of detecting vulnerabilities in binary code. Herein, we utilize recent studies conducted on the topic of deep learning and specifically study a maximal divergence sequential auto-encoder (MDSAE) model to propose a modified version (MDSAE-NR). We also propose an altered interpretation of time-delay neural network (TDNN-NR) by incorporating a new regularization technique that produced optimized results. Finally, both models achieved good predictive performance using different evaluation metrics such as accuracy, recall, precision and F1 score compared to the baseline results. Based on the results of our experiments, we observed a 2 to 2.5% average improvement in each performance measure of interest.

Highlights

Software becomes vulnerable if it contains flaws that could create a backdoor in the software from which a hacker can gain access to a system to conduct malicious activities
We study the maximal divergence sequential autoencoder (MDSAE) model and propose a modified version of the maximal divergence sequential auto-encoder (MDSAE) model that leverages a variational auto-encoder (VAE) and a new regularization technique for binary code vulnerability detection
We propose a new model based on a time-delay neural network (TDNN-NR)

Summary

INTRODUCTION

Software becomes vulnerable if it contains flaws that could create a backdoor in the software from which a hacker can gain access to a system to conduct malicious activities. Some previous studies have proposed methods for detecting vulnerabilities at the binary code level when the access to source code is not granted. In this context, such studies were based on symbolic execution, fuzzing [12]–[14], techniques that utilize handcrafted features derived from dynamic analysis [15]–[17], or functions similarity which helps in identifying known bugs in binaries [18]. We study the maximal divergence sequential autoencoder (MDSAE) model and propose a modified version of the MDSAE model that leverages a variational auto-encoder (VAE) and a new regularization technique for binary code vulnerability detection. The experimental results indicate that the two variants (MDSAE-NR and TDNN-NR) outperform the baselines in all performance measures of interest

BACKGROUND

THE KULLBACK-LEIBLER DIVERGENCE AND L2 WASSERSTEIN DISTANCE

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Modified Maximal Divergence Sequential Auto-Encoder and Time Delay Neural Network Models for Vulnerable Binary Codes Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Hidden code vulnerability detection: A study of the Graph-BiLSTM algorithm
Kao Ge ... Qing-Bang Han
Information and Software Technology | VOL. 175
Kao Ge, et. al.Kao Ge ... Qing-Bang Han
30 Jul 2024
Information and Software Technology | VOL. 175

Scalable Static Detection of Use-After-Free Vulnerabilities in Binary Code
Kailong Zhu ... Hui Huang
IEEE Access | VOL. 8
Kailong Zhu, et. al.Kailong Zhu ... Hui Huang
01 Jan 2020
IEEE Access | VOL. 8

Алгебраїчні шаблони вразливостей бінарного коду
V.M Yakovlev
PROBLEMS IN PROGRAMMING | VOL. -
V.M YakovlevV.M Yakovlev
01 Jan 2020
PROBLEMS IN PROGRAMMING | VOL. -

Combining Graph-Based Learning With Automated Data Collection for Code Vulnerability Detection
Huanting Wang ... Guixin Ye
IEEE Transactions on Information Forensics and Security | VOL. 16
Huanting Wang, et. al.Huanting Wang ... Guixin Ye
17 Dec 2020
IEEE Transactions on Information Forensics and Security | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Modified Maximal Divergence Sequential Auto-Encoder and Time Delay Neural Network Models for Vulnerable Binary Codes Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access