File Fragment Type Identification Based on CNN and LSTM

Nan Zhu,Changyou Ma,Kun Wang,Yang Liu

doi:10.1145/3585542.3585545

Abstract

In digital forensics, file carving is the process of recovering files on a storage media without any file system information. Note that when a file is deleted, the file system does not zero-out the corresponding data blocks because their content will be overwritten by other new files later. Due to a deleted file may be divided into different parts or successive but partly occupied by a new file, evidence may be found in deleted file fragments. Therefore, identifying the type of a file fragment is a necessary step for effective file carving. In this paper, we proposed a file fragment type identification network architecture based on CNN (convolutional neural networks) and LSTM (Long Short-Term Memory). Specifically, we first use a trainable embedding layer to convert sparse binary file fragment into compact real-valued representations. Then, successive convolutional modules are utilized to learn higher level representation of file fragments. Finally, the obtained features are fed into LSTM for classification. Our proposed deep network architecture was trained and tested on the largest public file fragment dataset FFT-75. Experimental results show that we can achieve average accuracy of 66.5% and 78.6% for 512-bytes and 4096-bytes file fragments, respectively, which are higher than existing work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

File Fragment Type Identification Based on CNN and LSTM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A New Approach to Multimedia Files Carving
Weidong Qiu ... Jie Guo
-
Weidong Qiu, et. al.Weidong Qiu ... Jie Guo
01 Nov 2014
01 Nov 2014

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Online leakage current classification using convolutional neural network long short-term memory for high voltage insulators on web-based service
Phuong Nguyen Thanh ... Ming-Yuan Cho
Electric Power Systems Research | VOL. 216
Phuong Nguyen Thanh, et. al.Phuong Nguyen Thanh ... Ming-Yuan Cho
01 Mar 2023
Electric Power Systems Research | VOL. 216

Enhancement of Text Recognizing Exploitation in Phishing Websites using LSTM in Comparison with CNN based on Improving the Accuracy Rate
Shaik Yakub Pasha
International Journal For Multidisciplinary Research | VOL. 5
Shaik Yakub Pasha Shaik Yakub Pasha
26 Oct 2023
International Journal For Multidisciplinary Research | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

File Fragment Type Identification Based on CNN and LSTM

Abstract

Talk to us

Similar Papers