Analyzing and Explaining Black-Box Models for Online Malware Detection

Harikha Manthena,Jeffrey C Kimmel,Mahmoud Abdelsalam,Maanak Gupta

doi:10.1109/access.2023.3255176

Abstract

In recent years, a significant amount of research has focused on analyzing the effectiveness of machine learning (ML) models for malware detection. These approaches have ranged from methods such as decision trees and clustering to more complex approaches like support vector machine (SVM) and deep neural networks. In particular, neural networks have proven to be very effective in detecting complex and advanced malware. This, however, comes with a caveat. Neural networks are notoriously complex. Therefore, the decisions that they make are often just accepted without questioning why the model made that specific decision. The black box characteristic of neural networks has challenged researchers to explore methods to explain black-box models such as SVM and neural networks and their decision-making process. Transparency and explainability give the experts and malware analysts assurance and trustworthiness about the ML models’ decisions. In addition, it helps in generating comprehensive reports that can be used to enhance cyber threat intelligence sharing. As such, this much needed analysis drives our work in this paper to explore the explainability and interpretability of ML models in the field of online malware detection. In this paper, we used the Shapley Additive exPlanations (SHAP) explainability technique to achieve efficient performance in interpreting the outcome of different ML models such as SVM Linear, SVM-RBF (Radial Basis Function), Random Forest (RF), Feed-Forward Neural Net (FFNN), and Convolutional Neural Network (CNN) models trained on an online malware dataset. To explain the output of these models, explainability techniques such as KernalSHAP, TreeSHAP, and DeepSHAP are applied to the obtained results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2023
Citations: 5	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Analyzing and Explaining Black-Box Models for Online Malware Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Optimisation and interpretation of machine and deep learning models for improved water quality management in Lake Loktak
Swapan Talukdar ... Atiqur Rahman
Journal of Environmental Management | VOL. 351
Swapan Talukdar, et. al.Swapan Talukdar ... Atiqur Rahman
25 Dec 2023
Journal of Environmental Management | VOL. 351

A radiomics-based interpretable machine learning model to predict the HER2 status in bladder cancer: a multicenter study
Zongjie Wei ... Yongpeng Xie
Insights into Imaging | VOL. 15
Zongjie Wei, et. al.Zongjie Wei ... Yongpeng Xie
28 Oct 2024
Insights into Imaging | VOL. 15

Rupture Risk Assessment for Cerebral Aneurysm Using Interpretable Machine Learning on Multidimensional Data.
Chubin Ou ... Chuan-Zhi Duan
Frontiers in neurology | VOL. 11
Chubin Ou, et. al.Chubin Ou ... Chuan-Zhi Duan
23 Dec 2020
Frontiers in neurology | VOL. 11

Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
Raquel Rodríguez-Pérez ... Jürgen Bajorath
Journal of Computer-Aided Molecular Design | VOL. 34
Raquel Rodríguez-Pérez, et. al.Raquel Rodríguez-Pérez ... Jürgen Bajorath
02 May 2020
Journal of Computer-Aided Molecular Design | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analyzing and Explaining Black-Box Models for Online Malware Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access