Abstract

With the rapid development of deep learning technologies, different deep learning models have been applied to detect and classify malware. When applying deep learning models to classify malware families, a major bottleneck is the lack of enough labeled family samples that are required for training deep learning models. The depth model applied to malware needs a huge number of samples for training. In order to solve this issue, we propose a method for generating malware family samples. We use the Grad-CAM algorithm to find the raw data representing malware features. A new sample is created by inserting them into section gaps and new sections in PE files. The experiment results show that adding the generated samples into training dataset can improve the classification accuracy of deep learning models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call