Optimizing Attack Surface and Configuration Diversity Using Multi-objective Reinforcement Learning

Bentz Tozer,Thomas Mazzuchi,Shahram Sarkani

doi:10.1109/icmla.2015.144

Abstract

Minimizing the attack surface of a system and introducing diversity into a system are two effective ways to improve system security. However, determining how to include diversity in a system without increasing the attack surface more than necessary is a difficult problem, requiring knowledge about the system characteristics, operating environment, and available permutations that is generally not available prior to system deployment. We propose viewing a system's components, interfaces, and communication channels as a set of states and actions that can be analyzed using a sequential decision making process, and using a multi-objective reinforcement learning algorithm to learn a set of policies that minimize a system's attack surface and execute those policies to obtain configuration diversity while a system is operating. We describe a methodology for designing a system such that its components and behaviors can be translated into a multi-objective Markov Decision Process, demonstrate the use of multi-objective reinforcement learning to learn a set of optimal policies using three different multi-objective reinforcement learning algorithms in the context of an online file sharing application, and show that our multi-objective temporal difference afterstate algorithm outperforms the alternatives for the example problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimizing Attack Surface and Configuration Diversity Using Multi-objective Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multi-objective Reinforcement Learning for Energy Harvesting Wireless Sensor Nodes
Shaswot Shresthamali ... Hiroshi Nakamura
-
Shaswot Shresthamali, et. al.Shaswot Shresthamali ... Hiroshi Nakamura
01 Dec 2021
01 Dec 2021

Policy gradient approaches for multi-objective sequential decision making
Simone Parisi ... Nicola Smacchia
-
Simone Parisi, et. al.Simone Parisi ... Nicola Smacchia
01 Jul 2014
01 Jul 2014

A robust policy bootstrapping algorithm for multi-objective reinforcement learning in non-stationary environments
Sherif Abdelfattah ... Kathryn Kasmarik
Adaptive Behavior | VOL. 28
Sherif Abdelfattah, et. al.Sherif Abdelfattah ... Kathryn Kasmarik
15 Aug 2019
Adaptive Behavior | VOL. 28

An Approach to Measuring a System's Attack Surface
Pratyusa K Manadhata ... Jeannette M Wing
-
Pratyusa K Manadhata, et. al.Pratyusa K Manadhata ... Jeannette M Wing
01 Aug 2007
01 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimizing Attack Surface and Configuration Diversity Using Multi-objective Reinforcement Learning

Abstract

Talk to us

Similar Papers