The prediction analysis is the approach of data mining which is applied to predict future possibilities based on the current information. The network traffic classification is the major issue of the prediction analysis due to complex dataset. The network traffic techniques have three steps, which are pre-processing, feature extraction and classification. In the phase of pre-processing data set is collected which is processed to removed missing and redundant values. In the second phase, the relationship between attribute and target set is established. In the last phase, the technique of classification is applied for the classification. This research study has been influenced by the different intrusion threats on internet and the ways to detect them. In this research, we have studied and analyzed the famous network traffic data -NSL KDD dataset and its various features. The proposed model is a hybrid of Logistic Regression and K-nearest neighbor classifier combined using voting classifier, which aims at classifying the data into malicious and non-malicious with more accuracy than existing methods.
Read full abstract