Detecting Cryptography Misuses With Machine Learning: Graph Embeddings, Transfer Learning and Data Augmentation in Source Code Related Tasks

Gustavo Eloi De Paula Rodrigues,Alexandre M Braga,Ricardo Dahab

doi:10.1109/tr.2023.3237849

Abstract

Cryptography is a ubiquitous tool in secure software development in order to guarantee security requirements in general. However, software developers have scarce knowledge about cryptography and rely on limited support tools that cannot properly detect bad uses of cryptography, thus generating vulnerabilities in software. In this work, we extend the scarcely use of machine learning to detect cryptography misuse in source code by using a state of the art deep learning model (i.e., <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">code2vec</i> ) through transfer learning to generate features that feed machine learning models. In addition, we compare this approach to previous ones in different types of binary models. Also, we adapt code obfuscation to serve as data augmentation in machine learning source code related tasks. Finally, we show that through transfer learning <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">code2vec</i> can be a competitive feature generator for cryptography misuse detection and simple code obfuscation can be used to generate data to enhance machine learning models training in source code related tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting Cryptography Misuses With Machine Learning: Graph Embeddings, Transfer Learning and Data Augmentation in Source Code Related Tasks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Reliability

Lead the way for us

Journal: IEEE Transactions on Reliability	Publication Date: Dec 1, 2023
Citations: 1

Similar Papers

A Systematic Literature Review on Automated Software Vulnerability Detection Using Machine Learning
Nima Shiri Harzevili ... Nachiappan Nagappan
ACM Computing Surveys | VOL. 57
Nima Shiri Harzevili, et. al.Nima Shiri Harzevili ... Nachiappan Nagappan
11 Nov 2024
ACM Computing Surveys | VOL. 57

Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
Sanghita Barui ... K S Rajmohan
Scientific Reports | VOL. 12
Sanghita Barui, et. al.Sanghita Barui ... K S Rajmohan
30 Sep 2022
Scientific Reports | VOL. 12

Physics-Guided Data Augmentation Combined with Unsupervised Learning Improves Stability and Accuracy of Bit Wear Deep Learning Model
Huang Xu ... Guodong David Zhan
-
Huang Xu, et. al.Huang Xu ... Guodong David Zhan
27 Feb 2024
27 Feb 2024

Prediction of Aquatic Ecosystem Health Indices through Machine Learning Models Using the WGAN-Based Data Augmentation Method
Seoro Lee ... Joo Hyun Bae
Sustainability | VOL. 13
Seoro Lee, et. al.Seoro Lee ... Joo Hyun Bae
18 Sep 2021
Sustainability | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Cryptography Misuses With Machine Learning: Graph Embeddings, Transfer Learning and Data Augmentation in Source Code Related Tasks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Reliability