Extensible Machine Learning for Encrypted Network Traffic Application Labeling via Uncertainty Quantification

Steven Jorgensen,Allan Wollaber,John Holodnak,Jensen Dempsey,Vernon Rivet,Andrés Alejos,Noah Demoes,Karla De Souza,Ananditha Raghunath

doi:10.1109/tai.2023.3244168

Steven Jorgensen, Allan Wollaber + Show 7 more

Open Access

https://doi.org/10.1109/tai.2023.3244168

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

With the increasing prevalence of encrypted network traffic, cyber security analysts have been turning to machine learning (ML) techniques to elucidate the traffic on their networks. However, ML models can become stale as new traffic emerges that is outside of the distribution of the training set. In order to reliably adapt in this dynamic environment, ML models must additionally provide contextualized uncertainty quantification to their predictions, which has received little attention in the cyber security domain. Uncertainty quantification is necessary both to signal when the model is uncertain about which class to choose in its label assignment and when the traffic is not likely to belong to any pre-trained classes. We present a new, public dataset of network traffic that includes labeled, Virtual Private Network (VPN)-encrypted network traffic generated by 10 applications and corresponding to 5 application categories. We also present an ML framework that is designed to rapidly train with modest data requirements and provide both calibrated, predictive probabilities as well as an interpretable "out-of-distribution" (OOD) score to flag novel traffic samples. We describe calibrating OOD scores using p-values of the relative Mahalanobis distance. We demonstrate that our framework achieves an F1 score of 0.98 on our dataset and that it can extend to an enterprise network by testing the model: (1) on data from similar applications, (2) on dissimilar application traffic from an existing category, and (3) on application traffic from a new category. The model correctly flags uncertain traffic and, upon retraining, accurately incorporates the new data.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Artificial Intelligence	Publication Date: Jan 1, 2024
Citations: 11	License type: CC BY 4.0

R Discovery Prime

Extensible Machine Learning for Encrypted Network Traffic Application Labeling via Uncertainty Quantification

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Artificial Intelligence

Lead the way for us

Similar Papers

Development and evaluation of uncertainty quantifying machine learning models to predict piperacillin plasma concentrations in critically ill patients
Jarne Verhaeghe ... Pieter Colin
BMC Medical Informatics and Decision Making | VOL. 22
Jarne Verhaeghe, et. al.Jarne Verhaeghe ... Pieter Colin
25 Aug 2022
BMC Medical Informatics and Decision Making | VOL. 22

Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial
Venkat Nemani ... Chao Hu
Mechanical Systems and Signal Processing | VOL. 205
Venkat Nemani, et. al.Venkat Nemani ... Chao Hu
19 Oct 2023
Mechanical Systems and Signal Processing | VOL. 205

Machine Learning Framework to Identify Individuals at Risk of Rapid Progression of Coronary Atherosclerosis: From the PARADIGM Registry.
Donghee Han ...
Journal of the American Heart Association | VOL. 9
Donghee Han, et. al.Donghee Han ...
22 Feb 2020
Journal of the American Heart Association | VOL. 9

Systematic literature review of machine learning based software development effort estimation models
Jianfeng Wen ... Changqin Huang
Information and Software Technology | VOL. 54
Jianfeng Wen, et. al.Jianfeng Wen ... Changqin Huang
16 Sep 2011
Information and Software Technology | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Extensible Machine Learning for Encrypted Network Traffic Application Labeling via Uncertainty Quantification

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE Transactions on Artificial Intelligence