ExtendedSketch: Fusing Network Traffic for Super Host Identification With a Memory Efficient Sketch

Xuyang Jing,Zheng Yan,Hui Han,Witold Pedrycz

doi:10.1109/tdsc.2021.3111328

Xuyang Jing, Zheng Yan + Show 2 more

Open Access

PDF Available

https://doi.org/10.1109/tdsc.2021.3111328

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Super host refers to the host that has a high cardinality or exhibits a big change in a network. Facing big-volume network traffic, sketches have been widely applied to identify super hosts in an efficient and accurate way. However, most sketches cannot flexibly balance memory usage and accuracy in host cardinality estimation. Setting an inappropriate counter size for a sketch could either lead to inaccurate host cardinality estimation or cause memory waste. In order to solve this issue, we propose a novel extensible and reversible sketch, named ExtendedSketch, to achieve accurate super host identification with high memory efficiency. The core idea of ExtendedSketch is to monitor low-cardinality hosts with small-sized counters while dynamically extending the size of counters when monitoring high-cardinality hosts by applying an adaptive extension strategy. Such the strategy can adaptively increase counter size according to network traffic status at runtime, which not only ensures the accuracy of high-cardinality host estimation but also avoids unnecessary memory consumption. We perform theoretical analysis and conduct a series of experimental evaluations on ExtendedSketch based on real world network traffic. Experimental results show that under same memory usage, compared to the state-of-the-art, ExtendedSketch achieves <inline-formula><tex-math notation="LaTeX">$1.4{ \sim }7.5$</tex-math></inline-formula> times smaller error rate in estimating host cardinality with <inline-formula><tex-math notation="LaTeX">$1.9{ \sim }26.7$</tex-math></inline-formula> times better accuracy on super host identification and <inline-formula><tex-math notation="LaTeX">$95 {\sim }2^{15}$</tex-math></inline-formula> times faster speed on abnormal address reconstruction. Its advance in accuracy and efficiency demonstrates the practical significance of ExtendedSketch for super host identification.

Highlights

S UPER host identification plays an important role in network management, which can be applied to detect network attacks (e.g., Distributed Denial of Service (DDoS) attacks [1], network scanning [2]), track hot-spot web content [3], monitor user activities [4]
We give an overview of ExtendedSketch. It is a novel data structure for super host identification originally proposed in this paper: its structure is adaptively adjusted based on the distribution of host cardinality, which is totally different from any existing sketches
We proposed ExtendedSketch, a novel extensible and reversible data structure, for host cardinality estimation

Summary

INTRODUCTION

S UPER host identification plays an important role in network management, which can be applied to detect network attacks (e.g., Distributed Denial of Service (DDoS) attacks [1], network scanning [2]), track hot-spot web content [3], monitor user activities [4]. We propose a novel extensible and reversible sketch to solve the above challenges about identifying super hosts facing skewed network traffic, named ExtendedSketch. The extensibility makes ExtendedSketch feasible to be applied into analyzing skewed big-volume traffic with high memory efficiency and super host identification accuracy. (1) We propose an extensible, mergeable and reversible sketch to estimate host cardinality and further identify super hosts, named ExtendedSketch It can achieve memory efficiency, fast traffic processing, and accurate reconstruction of addresses of super hosts at the same time. It is a novel data structure for super host identification originally proposed in this paper: its structure is adaptively adjusted based on the distribution of host cardinality, which is totally different from any existing sketches.

Method

OVERVIEW OF EXTENDEDSKETCH

The Structure of ExtendedSketch

Update Operation

Estimation Operation

Merge Operation

Reversible Calculation Operation

Super Host Identification

Super Spreader Identification

Super Changer Identification

THEORETICAL ANALYSIS

Analysis on Space and Time Complexities

Analysis of Estimation Operation of ExtendedSketch

EXPERIMENTAL STUDY AND PERFORMANCE EVALUATION

Datasets Description

Comparative Analysis

Host Cardinality Estimation

Effectiveness

Methods

Memory Efficiency

Findings

CONCLUSIONS

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Dependable and Secure Computing	Publication Date: Nov 1, 2022
Citations: 15	License type: CC BY 4.0

R Discovery Prime

ExtendedSketch: Fusing Network Traffic for Super Host Identification With a Memory Efficient Sketch

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Similar Papers

ExtendedSketch+: Super host identification and network host trust evaluation with memory efficiency and high accuracy
Hui Han ... Witold Pedrycz
Information Fusion | VOL. 92
Hui Han, et. al.Hui Han ... Witold Pedrycz
13 Dec 2022
Information Fusion | VOL. 92

Development of One-to-One Shortest Path Algorithm Based on Link Flow Speeds on Urban Networks
Taehyeong Kim ... Bum-Jin Park
The Journal of The Korea Institute of Intelligent Transport Systems | VOL. 11
Taehyeong Kim, et. al.Taehyeong Kim ... Bum-Jin Park
30 Oct 2012
The Journal of The Korea Institute of Intelligent Transport Systems | VOL. 11

Real-time traffic jams prediction inspired by Biham, Middleton and Levine (BML) model
Wenbin Hu ... Dacheng Tao
Information Sciences | VOL. 381
Wenbin Hu, et. al.Wenbin Hu ... Dacheng Tao
28 Nov 2016
Information Sciences | VOL. 381

Spatial and Temporal Analysis of Congestion in Urban Transportation Networks

-

01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

ExtendedSketch: Fusing Network Traffic for Super Host Identification With a Memory Efficient Sketch

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing