Mitigating Webshell Attacks through Machine Learning Techniques

You Guo,Hector Marco-Gisbert,Paul Keir

doi:10.3390/fi12010012

You Guo, Hector Marco-Gisbert + Show 1 more

Open Access

https://doi.org/10.3390/fi12010012

Copy DOI

Abstract

A webshell is a command execution environment in the form of web pages. It is often used by attackers as a backdoor tool for web server operations. Accurately detecting webshells is of great significance to web server protection. Most security products detect webshells based on feature-matching methods—matching input scripts against pre-built malicious code collections. The feature-matching method has a low detection rate for obfuscated webshells. However, with the help of machine learning algorithms, webshells can be detected more efficiently and accurately. In this paper, we propose a new PHP webshell detection model, the NB-Opcode (naïve Bayes and opcode sequence) model, which is a combination of naïve Bayes classifiers and opcode sequences. Through experiments and analysis on a large number of samples, the experimental results show that the proposed method could effectively detect a range of webshells. Compared with the traditional webshell detection methods, this method improves the efficiency and accuracy of webshell detection.

Highlights

With the development of web technology and the explosive growth of information, web security becomes more and more important
Opcode is the intermediate language after PHP script compilation, and its relationship with PHP is analogous to Java virtual machine (JVM) byte-code’s relationship to Java
A webshell detection method based on a naïve Bayes algorithm and opcode sequence is proposed

Summary

Introduction

With the development of web technology and the explosive growth of information, web security becomes more and more important Web vulnerabilities such as SQL injection and XSS attacks [1] are some of the most common security problems. Attackers often exploit vulnerabilities in the system or web applications to upload a malicious file or malicious code to the webserver. Attackers use a range of methods to bypass traditional detection, including malicious function segmentation, Base encoding, and other techniques. These traditional webshell detection methods are ineffective in detecting webshells that have been obfuscated.

Webshell

Simple Webshell

Machine Learning

Unsupervised Learning

Static and Dynamic Detection

Flow Analysis Detection

Log Analysis Detection

Behavior Analysis Detection

Statistical Analysis

Threats

Plain Webshell

Obfuscated Webshell

Split Webshell

Remote Webshell

Proposed Solution

Opcode

Data Preprocessing

Feature Extraction and Representation

Word Bag and TF-IDF Models

Model Training and Validation

Experiments

Effectiveness of the Approach

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future internet	Publication Date: Jan 14, 2020
Citations: 22	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Mitigating Webshell Attacks through Machine Learning Techniques

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future internet

Lead the way for us

Similar Papers

Detection approach of webshell attacks based on multi-dimensional dynamic features
Zhang Jiawen ... Wang Shaohui
-
Zhang Jiawen, et. al.Zhang Jiawen ... Wang Shaohui
01 Jun 2021
01 Jun 2021

Webshell detection with byte-level features based on deep learning
Xiao Zhongzheng ... Nurbol Luktarhan
Journal of Intelligent & Fuzzy Systems | VOL. 40
Xiao Zhongzheng, et. al.Xiao Zhongzheng ... Nurbol Luktarhan
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. 40

Research on WebShell Detection Method Based on Regularized Neighborhood Component Analysis (RNCA)
Aijun Zhou ... Nurbol Luktarhan
Symmetry | VOL. 13
Aijun Zhou, et. al.Aijun Zhou ... Nurbol Luktarhan
04 Jul 2021
Symmetry | VOL. 13

Study on Charged Detection Method of Porcelain Insulator in Single Asymmetric Transmission Line
Keqiang Wang ... Xiaoning Tang
-
Keqiang Wang, et. al.Keqiang Wang ... Xiaoning Tang
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mitigating Webshell Attacks through Machine Learning Techniques

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future internet