Reinforcement Learning for Efficient Network Penetration Testing

Mohamed C Ghanem,Thomas M Chen

doi:10.3390/info11010006

Abstract

Penetration testing (also known as pentesting or PT) is a common practice for actively assessing the defenses of a computer network by planning and executing all possible attacks to discover and exploit existing vulnerabilities. Current penetration testing methods are increasingly becoming non-standard, composite and resource-consuming despite the use of evolving tools. In this paper, we propose and evaluate an AI-based pentesting system which makes use of machine learning techniques, namely reinforcement learning (RL) to learn and reproduce average and complex pentesting activities. The proposed system is named Intelligent Automated Penetration Testing System (IAPTS) consisting of a module that integrates with industrial PT frameworks to enable them to capture information, learn from experience, and reproduce tests in future similar testing cases. IAPTS aims to save human resources while producing much-enhanced results in terms of time consumption, reliability and frequency of testing. IAPTS takes the approach of modeling PT environments and tasks as a partially observed Markov decision process (POMDP) problem which is solved by POMDP-solver. Although the scope of this paper is limited to network infrastructures PT planning and not the entire practice, the obtained results support the hypothesis that RL can enhance PT beyond the capabilities of any human PT expert in terms of time consumed, covered attacking vectors, accuracy and reliability of the outputs. In addition, this work tackles the complex problem of expertise capturing and re-use by allowing the IAPTS learning module to store and re-use PT policies in the same way that a human PT expert would learn but in a more efficient way.

Highlights

Computer networks are more than ever exposed to cyber threats of increasing frequency, complexity and sophistication [1]
We presented a general background of the reinforcement learning (RL) domain and justified the choice of such approach for intelligent automated penetration testing system (IAPTS) along with a brief introduction of the considered candidate algorithms and their advantages and in consideration of the specific context of PT in complex and large RL
In the first phases of this research, we aimed to assess the effectiveness of the proposed partially observed Markov decision process (POMDP) modeling of PT and evaluating our choices in terms of learning approaches, used algorithms, and capturing and managing the expertise as we discussed in detail in [12]

Summary

Introduction

Computer networks are more than ever exposed to cyber threats of increasing frequency, complexity and sophistication [1]. Penetration testing (shortly known as pentesting PT) is a wellestablished proactive method to evaluate the security of digital assets, varying from a single computer to websites and networks, by actively searching for and exploiting the existing vulnerabilities. In addition to legal requirements, PT is considered by the cybersecurity community as the most effective method to assess the strength of security defenses against skilled adversaries as well as the adherence to security policies [2]. PT as illustrated in Figure 1 is a multi-stage process that often requires a high degree of competence and expertise due to the complexity of digital assets such as medium and large networks

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Dec 20, 2019
Citations: 64	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Efficient Network Penetration Testing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Deep Reinforcement Learning With Modulated Hebbian Plus Q-Network Architecture.
Pawel Ladosz ... Nicholas Ketz
IEEE transactions on neural networks | VOL. 33
Pawel Ladosz, et. al.Pawel Ladosz ... Nicholas Ketz
01 May 2022
IEEE transactions on neural networks | VOL. 33

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics
Xuanchen Xiang ... Huanyu Zang
Machine Learning and Knowledge Extraction | VOL. 3
Xuanchen Xiang, et. al.Xuanchen Xiang ... Huanyu Zang
28 Oct 2021
Machine Learning and Knowledge Extraction | VOL. 3

A special case of partially observable Markov decision processes problem by event-based optimization
Junyu Zhang
-
Junyu ZhangJunyu Zhang
01 Mar 2016
01 Mar 2016

A Bayesian game based adaptive fuzzy controller for multiagent POMDPs
Rajneesh Sharma ... Matthijs T J Spaan
-
Rajneesh Sharma, et. al.Rajneesh Sharma ... Matthijs T J Spaan
01 Jul 2010
01 Jul 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Efficient Network Penetration Testing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information