In recent years, Internet traffic classification using machine learning has become new direction in network measurement. In this method, choose the appropriate traffic features is the key. The selection of feature in previous studies dependent on the specific data set and does not have the versatility to identify the data sets captured in the actual network conditions. We analyze and select a group of features based on public data set and the data collected in the actual network. Experimental results show that the selected feature set with stable performance and effective identification ability by using C4.5 decision tree method.
Read full abstract