Internet Traffic Classification Using Machine Learning

Li Jun Li Jun,Zhang Zailong,Lu Yanqing Lu Yanqing,Zhang Shunyi

doi:10.1109/chinacom.2007.4469372

Abstract

Internet traffic identification and classification is vital to the areas of network management and security monitoring, network planning, and QoS provision. Traditional approaches such as port-based and payload-based identification are becoming increasingly difficult with many new applications (e.g. P2P) using dynamic port numbers, masquerading techniques, and encryption to avoid detection. An alternative approach is to classify traffic by exploiting the distinctive characteristics of flow statistics. We present here a traffic classification scheme based on machine learning (ML). The performance impact of the dataset size, feature selection and ML algorithm selection is demonstrated by experiments. The genetic algorithm based feature selection can dramatically reduce the ML learning and modeling time with less decrease or even a bit increase in classification accuracy. The chosen ML algorithms: TAN, C4.5, NBTree, RandomForest and distance weighted KNN, can reach high classification accuracy. Typically, C4.5 and RandomForest are superior to other ML algorithms in computational complexity. Besides, experiments show that the size of data set would impact on the classification performance, and tuning dataset's size could meet the requirements of specific applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Internet Traffic Classification Using Machine Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria.
Robson P Bonidia ... Anderson P Avila Santos
Briefings in Bioinformatics | VOL. 23
Robson P Bonidia, et. al.Robson P Bonidia ... Anderson P Avila Santos
27 Jun 2022
Briefings in Bioinformatics | VOL. 23

Hybrid meta-heuristic and machine learning algorithms for tunneling-induced settlement prediction: A comparative study
Pin Zhang ... Tommy H.T Chan
Tunnelling and Underground Space Technology | VOL. 99
Pin Zhang, et. al.Pin Zhang ... Tommy H.T Chan
20 Mar 2020
Tunnelling and Underground Space Technology | VOL. 99

BioAutoML: Democratizing Machine Learning in Life Sciences
Robson Parmezan Bonidia ... Carvalho
-
Robson Parmezan Bonidia, et. al.Robson Parmezan Bonidia ... Carvalho
25 Jun 2024
25 Jun 2024

6 - Machine learning for biomedical signal analysis
Sri Krishnan
Biomedical Signal Analysis for Connected Healthcare | VOL. -
Sri KrishnanSri Krishnan
01 Jan 2020
Biomedical Signal Analysis for Connected Healthcare | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Internet Traffic Classification Using Machine Learning

Abstract

Talk to us

Similar Papers