Malware Classification by Deep Learning Using Characteristics of Hash Functions

Takahiro Baba,Toshihiro Yamauchi,Kensuke Baba

doi:10.1007/978-3-030-99587-4_40

Takahiro Baba, Toshihiro Yamauchi + Show 1 more

https://doi.org/10.1007/978-3-030-99587-4_40

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2022

Citations: 1

Affiliation: Okayama University

Abstract
Full-Text
Similar Papers

Abstract

Listen

AbstractAs the Internet develops, the number of Internet of Things (IoT) devices increases. Simultaneously, the risk of IoT devices being infected with malware also increases. Thus, malware detection has become an important issue. Dynamic analysis logs are effective at detecting malware, but it takes time to collect a large amount of data because the malware must be executed at least once before the logs can be collected. Moreover, dynamic analysis logs are affected by external factors such as the execution environment. A malware detection method that uses a static property analysis log could solve these problems. In this study, deep learning (DL) was used as a machine learning method because DL is effective for large-scale data and can automatically extract features.Research has been conducted on malware detection using static properties of portable executable (PE) files, establishing that such detection is possible. However, research on malware detection using hash functions such as Fuzzy hash and peHash is lacking. Therefore, we investigated the characteristics of hash values in malware classification. Moreover, when the surface analysis log is viewed in chronological order, that the data are considered have concept drift characteristics. Therefore, we compared malware detection performance using data with the concept drift property. We found that the hash function could be used to prevent performance degradation even with concept drift data. In an experiment combining PE surface information and hash values, concept drift showed the highest performance for certain data.KeywordsMalware detectionDeep learningPE fileFuzzy hashpeHash

Full Text