VPN and Non-VPN Network Traffic Classification Using Time-Related Features

Mustafa Al-Fayoumi,Shadi Nashwan,Mohammad Al-Fawa’Reh

doi:10.32604/cmc.2022.025103

Mustafa Al-Fayoumi, Shadi Nashwan + Show 1 more

Open Access

https://doi.org/10.32604/cmc.2022.025103

Copy DOI

Abstract

The continual growth of the use of technological appliances during the COVID-19 pandemic has resulted in a massive volume of data flow on the Internet, as many employees have transitioned to working from home. Furthermore, with the increase in the adoption of encrypted data transmission by many people who tend to use a Virtual Private Network (VPN) or Tor Browser (dark web) to keep their data privacy and hidden, network traffic encryption is rapidly becoming a universal approach. This affects and complicates the quality of service (QoS), traffic monitoring, and network security provided by Internet Service Providers (ISPs), particularly for analysis and anomaly detection approaches based on the network traffic’s nature. The method of categorizing encrypted traffic is one of the most challenging issues introduced by a VPN as a way to bypass censorship as well as gain access to geo-locked services. Therefore, an efficient approach is especially needed that enables the identification of encrypted network traffic data to extract and select valuable features which improve the quality of service and network management as well as to oversee the overall performance. In this paper, the classification of network traffic data in terms of VPN and non-VPN traffic is studied based on the efficiency of time-based features extracted from network packets. Therefore, this paper suggests two machine learning models that categorize network traffic into encrypted and non-encrypted traffic. The proposed models utilize statistical features (SF), Pearson Correlation (PC), and a Genetic Algorithm (GA), preprocessing the traffic samples into net flow traffic to accomplish the experiment’s objectives. The GA-based method utilizes a stochastic method based on natural genetics and biological evolution to extract essential features. The PC-based method performs well in removing different features of network traffic. With a microsecond per-packet prediction time, the best model achieved an accuracy of more than 95.02 percent in the most demanding traffic classification task, a drop in accuracy of only 2.37 percent in comparison to the entire statistical-based machine learning approach. This is extremely promising for the development of real-time traffic analyzers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2022
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

VPN and Non-VPN Network Traffic Classification Using Time-Related Features

Abstract

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

Minimal cost design of virtual private networks
Huan Liang ... D Makrakis
-
Huan Liang, et. al. Huan Liang ... D Makrakis
07 Aug 2002
07 Aug 2002

Detection of VPN Network Traffic
Avnish Goel ... Prasad B Honnavali
-
Avnish Goel, et. al.Avnish Goel ... Prasad B Honnavali
11 Feb 2022
11 Feb 2022

QoS-Classifier for VPN and Non-VPN traffic based on time-related features
Julian Andres Caicedo-Muñoz ... Alvaro Rendón
Computer Networks | VOL. 144
Julian Andres Caicedo-Muñoz, et. al.Julian Andres Caicedo-Muñoz ... Alvaro Rendón
13 Aug 2018
Computer Networks | VOL. 144

VPN Analysis and New Perspective for Securing Voice over VPN Networks
Wafaa Bou Diab ... Carole Bassil
-
Wafaa Bou Diab, et. al.Wafaa Bou Diab ... Carole Bassil
01 Mar 2008
01 Mar 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VPN and Non-VPN Network Traffic Classification Using Time-Related Features

Abstract

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua