Abstract

Network traffic classification based on machine learning is an important branch of pattern recognition in computer science. It is a key technology for dynamic intelligent network management and enhanced network controllability. However, the traffic classification methods still facing severe challenges: The optimal set of features is difficult to determine. The classification method is highly dependent on the effective characteristic combination. Meanwhile, it is also important to balance the experience risk and generalization ability of the classifier. In this paper, an improved network traffic classification model based on a support vector machine is proposed. First, a filter-wrapper hybrid feature selection method is proposed to solve the false deletion of combined features caused by a traditional feature selection method. Second, to balance the empirical risk and generalization ability of support vector machine (SVM) traffic classification model, an improved parameter optimization algorithm is proposed. The algorithm can dynamically adjust the quadratic search area, reduce the density of quadratic mesh generation, improve the search efficiency of the algorithm, and prevent the over-fitting while optimizing the parameters. The experiments show that the improved traffic classification model achieves higher classification accuracy, lower dimension and shorter elapsed time and performs significantly better than traditional SVM and the other three typical supervised ML algorithms.

Highlights

  • With the dramatic growth in network applications, the current network has become a large, dynamic and complex system

  • We compare the proposed model with the traditional support vector machine (SVM), and the representative supervised machine learning algorithm. It shows that traffic classification performance can be significantly improved by the proposed model by using very few training samples

  • We proposed an improved grid search algorithm, the algorithm gets the best combination of the key parameter of SVM classifier to improve the classification accuracy and prevent overfitting

Read more

Summary

Introduction

With the dramatic growth in network applications, the current network has become a large, dynamic and complex system. The traffic classification method based on machine learning faces some challenges: (1) The optimal feature subset is difficult to obtain. We focus on reducing the dimension of the flow features and deriving the optimal working parameters based on the machine learning model. Traffic classification model, improve its classification and generalization ability, and improved the grid search parameter optimization algorithm is proposed. We compare the proposed model with the traditional SVM, and the representative supervised machine learning algorithm. It shows that traffic classification performance can be significantly improved by the proposed model by using very few training samples.

Method Based on Port Numbers
Method Based on Deep Packet Inspection
Method Based on Protocol Analysis
Method Based on Machine Learning Techniques
Model Framework
The Filter-Wrapper Feature Selection Algorithm
Filter-Wrapper Feature Selection Model Framework
Filter-Wrapper Feature Selection Algorithm Framework
Filter Feature Selection Algorithm Evaluation Strategy
Wrapper Feature Selection Algorithm Search Strategy
Basic Principle of Traditional Grid Search Algorithm
Improved Grid Search Algorithm Framework
16. End if
IGS Evaluation Strategy
Datasets
Performance Analysis of Filter-Wrapper Feature Selection
Initialization Parameter Setting
Second-Grid Parameter Optimization
Comparison and Analysis of Each Stage of the IGS Algorithm
Comparison and Analysis of IGS and Other Algorithms
Comparison and Analysis of IGS on CAIDA Datasets
Comparison of IGS and Other Related Approaches
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call