Conditional heavy hitters: detecting interesting correlations in data streams

Katsiaryna Mirylenka,Themis Palpanas,Graham Cormode,Divesh Srivastava

doi:10.1007/s00778-015-0382-5

Abstract

The notion of heavy hitters--items that make up a large fraction of the population--has been successfully used in a variety of applications across sensor and RFID monitoring, network data analysis, event mining, and more. Yet this notion often fails to capture the semantics we desire when we observe data in the form of correlated pairs. Here, we are interested in items that are conditionally frequent: when a particular item is frequent within the context of its parent item. In this work, we introduce and formalize the notion of conditional heavy hitters to identify such items, with applications in network monitoring and Markov chain modeling. We explore the relationship between conditional heavy hitters and other related notions in the literature, and show analytically and experimentally the usefulness of our approach. We introduce several algorithm variations that allow us to efficiently find conditional heavy hitters for input data with very different characteristics, and provide analytical results for their performance. Finally, we perform experimental evaluations with several synthetic and real datasets to demonstrate the efficacy of our methods and to study the behavior of the proposed algorithms for different types of data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Conditional heavy hitters: detecting interesting correlations in data streams

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal

Lead the way for us

Journal: The VLDB Journal	Publication Date: Feb 26, 2015
Citations: 60

Similar Papers

Finding interesting correlations with conditional heavy hitters
K Mirylenka ... G Cormode
-
K Mirylenka, et. al.K Mirylenka ... G Cormode
01 Apr 2013
01 Apr 2013

Communication-efficient algorithms for tracking distributed data streams
Qin Zhang
-
Qin ZhangQin Zhang
23 Dec 2014
23 Dec 2014

Real-time Spread Burst Detection in Data Streaming.
Haibo Wang ... Dimitrios Melissourgos
Proceedings of the ACM on Measurement and Analysis of Computing Systems | VOL. 7
Haibo Wang, et. al.Haibo Wang ... Dimitrios Melissourgos
19 May 2023
Proceedings of the ACM on Measurement and Analysis of Computing Systems | VOL. 7

Development and Evaluation of Stochastic Rainfall Models for Urban Drought Security Assessment
Afm Kamal Chowdhury
SSRN Electronic Journal | VOL. -
Afm Kamal ChowdhuryAfm Kamal Chowdhury
01 Jan 2015
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Conditional heavy hitters: detecting interesting correlations in data streams

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal