Unknown Traffic Recognition Based on Multi-Feature Fusion and Incremental Learning

Junyi Liu,Tian Yan,Jiarong Wang,Gang Chen,Fazhi Qi

doi:10.3390/app13137649

Junyi Liu, Tian Yan + Show 3 more

Open Access

PDF Available

https://doi.org/10.3390/app13137649

Copy DOI

Export

Save

Cite

Journal: Applied Sciences	Publication Date: Jun 28, 2023
Citations: 3	License type: CC BY 4.0

Affiliation: Institute of High Energy Physics

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Accurate classification and identification of Internet traffic are crucial for maintaining network security. However, unknown network traffic in the real world can affect the accuracy of current machine learning models, reducing the efficiency of traffic classification. Existing unknown traffic classification algorithms are unable to optimize traffic features and require the entire system to be retrained each time new traffic data are collected. This results in low recognition efficiency, making the algoritms unsuitable for real-time application detection. To solve the above issues, we suggest a multi-feature fusion-based incremental technique for detecting unknown traffic in this paper. The approach employs a multiple-channel parallel architecture to extract temporal and spatial traffic features. It then uses the mRMR algorithm to rank and fuse the features extracted from each channel to overcome the issue of redundant encrypted traffic features. In addition, we combine the density-ratio-based clustering algorithm to identify the unknown traffic features and update the model via incremental learning. The cassifier enables real-time classification of known and unknown traffic by learning newly acquired class knowledge. Our model can identify encrypted unknown Internet traffic with at least 86% accuracy in various scenarios, using the public ISCX-VPN-Tor datasets. Furthermore, it achieves 90% accuracy on the intrusion detection dataset NSL-KDD. In our self-collected dataset from a real-world environment, the accuracy of our model exceeds 96%. This work offers a novel method for identifying unknown network traffic, contributing to the security preservation of network environments.

Full Text