Compact feature hashing for machine learning based malware detection

Damin Moon,Myungkeun Yoon,Jaekoo Lee

doi:10.1016/j.icte.2021.08.005

Compact feature hashing for machine learning based malware detection

Damin Moon, Myungkeun Yoon + Show 1 more

Open Access

https://doi.org/10.1016/j.icte.2021.08.005

Copy DOI

Export

Save

Cite

Journal: ICT Express	Publication Date: Mar 1, 2022
Citations: 6	License type: cc-by-nc-nd

Affiliation: Kookmin University

#Machine Learning Based Malware Detection #Feature Hashing #Vector Size #Reduces Memory Space #Malware Files #Benign Files #Real Malware #Fixed-length Vector #Dataset Of Files #Variant Files

Abstract
Full-Text
Similar Papers

Abstract

Listen

Machine learning can detect variant malware files that can evade signature-based detection. Feature hashing is used to convert features into a fixed-length vector. In this paper, we study the appropriate vector size for feature hashing for a large dataset of malware files. Through exhaustive experiments on more than 280,000 real malware and benign files, we find for the first time that the default vector size of current feature hashing practices is unnecessarily large. We experimentally explore the appropriate vector size, which not only reduces memory space by 70% but also increases the detection accuracy, compared with the state-of-the-art scheme.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: ICT Express

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Compact feature hashing for machine learning based malware detection

Abstract

Published Version

Talk to us

Similar Papers

More From: ICT Express

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Compact feature hashing for machine learning based malware detection

Abstract

Published Version

Talk to us

Similar Papers

More From: ICT Express