Toward Bulk Synchronous Parallel-Based Machine Learning Techniques for Anomaly Detection in High-Speed Big Data Networks

Kamran Siddique,Woongsup Kim,Zahid Akhtar,Yangwoo Kim,Haeng-Gon Lee

doi:10.3390/sym9090197

Abstract

Anomaly detection systems, also known as intrusion detection systems (IDSs), continuously monitor network traffic aiming to identify malicious actions. Extensive research has been conducted to build efficient IDSs emphasizing two essential characteristics. The first is concerned with finding optimal feature selection, while another deals with employing robust classification schemes. However, the advent of big data concepts in anomaly detection domain and the appearance of sophisticated network attacks in the modern era require some fundamental methodological revisions to develop IDSs. Therefore, we first identify two more significant characteristics in addition to the ones mentioned above. These refer to the need for employing specialized big data processing frameworks and utilizing appropriate datasets for validating system’s performance, which is largely overlooked in existing studies. Afterwards, we set out to develop an anomaly detection system that comprehensively follows these four identified characteristics, i.e., the proposed system (i) performs feature ranking and selection using information gain and automated branch-and-bound algorithms respectively; (ii) employs logistic regression and extreme gradient boosting techniques for classification; (iii) introduces bulk synchronous parallel processing to cater computational requirements of high-speed big data networks; and; (iv) uses the Infromation Security Centre of Excellence, of the University of Brunswick real-time contemporary dataset for performance evaluation. We present experimental results that verify the efficacy of the proposed system.

Highlights

This decade has witnessed tremendous growth in cyberspace and various computing devices.Proliferation of the Internet with these computing devices has enhanced efficiency and productivity in almost all the dimensions of life
The advances in high-speed big Anomaly detection is a significant issue in computer networks
The other two characteristics combat the challenges introduced by large-scale networks and sophisticated network attacks, namely utilizing specialized big data computing engines and obtaining contemporary workloads to conduct performance evaluations of the proposed systems

Summary

Introduction

This decade has witnessed tremendous growth in cyberspace and various computing devices. During the past number of years, anomaly detection based on machine learning and data mining techniques have received considerable attention among researchers. There are two important aspects that hinder the progress of NIDS research and greatly need the attention of IDS research community They are concerned with the decision to select appropriate big data computing framework and to utilize adequate datasets for the evaluation of an IDS. We emphasize that the value and legitimacy of such decisions is important as other fundamental characteristics possess in the process of developing efficient IDSs. Building on the points addressed so far, we introduce a comprehensive IDS incorporating bulk synchronous parallel

Background and Related Work

Utilizing Machine Learning and Bulk Synchronous Parallel Computing Techniques

Proposed Framework

Data Preprocessing

Feature Ranking and Selection

Attack Recognition

Dataset and Experimental Setup

Performance Evaluation

Results and Discussion

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Sep 19, 2017
Citations: 18	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Toward Bulk Synchronous Parallel-Based Machine Learning Techniques for Anomaly Detection in High-Speed Big Data Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Developing an Intrusion Detection Framework for High-Speed Big Data Networks: A Comprehensive Approach
...
The KSII Transactions on Internet and Information Systems | VOL. 12
, et. al. ...
31 Aug 2018
The KSII Transactions on Internet and Information Systems | VOL. 12

A Multi-Layered Approach to the Design of Intelligent Intrusion Detection and Prevention System (IIDPS)
Oludele Awodele ... Sunday Idowu
Issues in Informing Science and Information Technology | VOL. 6
Oludele Awodele, et. al.Oludele Awodele ... Sunday Idowu
01 Jan 2009
Issues in Informing Science and Information Technology | VOL. 6

Intrusion Detection in High-Speed Big Data Networks: A Comprehensive Approach
Kamran Siddique ... Zahid Akhtar
-
Kamran Siddique, et. al.Kamran Siddique ... Zahid Akhtar
20 Dec 2017
20 Dec 2017

Flow-based Anomaly Detection in High-Speed Networks

-

23 Jan 2018
23 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Bulk Synchronous Parallel-Based Machine Learning Techniques for Anomaly Detection in High-Speed Big Data Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry